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Description 

The present invention relates to DNA molecules responsible for encoding the multi-functional proteins that direct 
the biosynthesis of polyketide compounds, the products encoded by said DNA molecules, recombinant DNA expression 
vectors, and transformed microbial host cells. 

Polyketides are a family of compounds that include a large number of structurally and functionally diverse natural 
products. For example, the polyketides provide the structural backbone for compounds that exhibit a variety of biological 
activities, such as, antibiotic, antitumor, and immunosuppressive agents. 

Although the polyketides are quite diverse as indicated, they share a common mechanistic scheme of biosynthesis. 
The polyketides are synthesized by the successive condensation of small carboxylic acid residues followed by variable 
reduction steps at the resulting p-keto carbon (i.e., p-carbonyl) moiety in a process that is similar to the synthesis of 
fatty acids. The iterative synthetic process for many of these polyketides is controlled by a complex of large, multi- 
functional polypeptides that have distinct sites for the variety of activities that are required. 

The general scheme for polyketide biosynthesis has been reviewed, for example, in Hopwoodand Sherman, Annu. 
Rev. Genet., 1990, 24:37 and Katz and Donadio, Annu. Rev. Microbiol., 1993, 47:875. 

Naturally-occurring DNA sequences that encode the polyketide synthase enzymes have generally been found to 
be organized into repeated subunits, or modules, each of which encodes all the activities required in a single round of 
synthesis, which includes the condensation step itself and the post-condensation processing steps. Each activity is 
associated with a distinct site, which contributes to the specificity for the particular carboxylic acid building block that 
is incorporated at each condensation step, or which dictates the particular post-condensation processing functions 
that will be executed. 

For example, PCT publication WO 93/13663 describes the organization of the gene encoding the polyketide syn- 
thase of Saccharopolyspora erythraea. The gene is organized in modules, with each module effecting one condensation 
step. The precise sequence of chain growth and the processing of the growing chain is determined by the genetic 
information in each module. This PCT application describes an approach for synthesizing novel polyketide structures 
by manipulating in several ways the DNA governing the biosynthesis of the cyclic lactone framework. In order to adapt 
this methodology to other polyketides, however, the DNA molecules directing the biosynthetic processing must first be 
isolated. 

The present invention is directed to the DNA sequence for the gene cluster responsible for encoding tylactone 
synthase, the building machinery of tylactone, which is the basic building block of tylosin. As a result, the present 
invention enables modification of the DNA sequence so as to alter the type of carboxylic acids incorporated, the number 
of carboxylic acids incorporated, the post-condensation reactions performed, or any combination thereof! thereby re- 
sulting in novel tylosin-related polyketides. 

The present invention provides a DNA molecule comprising an isolated DNA sequence that encodes a tylactone 
synthase domain. Thus, the present invention provides the DNA molecule of SEQ ID NO:1 and DNA molecules that 
contain submodules thereof. The present invention also provides the products encoded by said DNA molecules, re- 
combinant DNA expression vectors, and transformed microbial host cells. 

Figure 1 is a map of the tylactone polyketide synthase region (tylG) of the Streptomyces fradiae DNA (-45 kb). 
Distances in kb are relative to the beginning of tylG. Open reading frames (ORFs) are indicated by arrows. Restriction 
sites are denoted as follows: 

E - EcoRI 
B - SamHI 

Predicted functional domains are labeled as follows: 

ACP - acyl carrier protein 

AT - acyltransferase 

DH - dehydratase 

ER - enoylreductase 

KR - ketoreductase 

KR 1 - ketoreductase-like domain predicted to be inactive 

KS - ketosynthase 

KS' - ketosynthase-like domain in which a glutamine residue is present in the position occupied by an active site 

cysteine in a normal ketosynthase 

TE - thioesterase. 



Figure 2 depicts the biosynthetic pathway for tylactone synthesis. 
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Figure 3 is a map of the two clones that span the whole region of the tylG DNA. 

Figure 4 is a map of the srmG region of the Streptomyces ambofaciens DNA. Distances in kb are relative to the 
beginning of srmG. Open reading frames are indicated by arrows. The srmG DNA (0-42 kb) is the platenolide polyketide 
synthase region. Restriction sites are denoted as above with additions as follows: 

5 

AP - Apa\ 
G - Bgh\ 
K- Kpri 
P- Psfl 
JO X - Xho\ 

Predicted functional domains are labeled as above. 

Figure 5 demonstrates the biosynthetic pathway for platenolide synthesis. 
Figure 6 is a map of the two clones that span the whole region of the srmG DNA. 
is Polyketides are a large class of structurally and functionally diverse natural products. A common feature of com- 

pounds in this class is that their synthesis is accomplished under the direction of a complex of multi-functional peptides, 
termed a "polyketide synthase." Molecular genetic analysis of polyketide synthase genes has revealed two distinct 
classes of enzymes operating for different polyketides: 1 ) the aromatics, which are made through an essentially iterative 
process, and 2) the complex polyketides, which comprise several repeats of the same activities arranged in few very 
20 large polypeptides. 

Among the complex polyketide synthase genes, a polyketide synthase includes enzymatic and regulatory activities 
responsible for exercising substrate specificity, catalyzing the condensation of small carboxylic acid building blocks (in 
the form of coenzyme A thioesters) to a growing polyketide carbon chain, and catalyzing the post-condensation 
processing reactions at the p-carbonyl functional group that results from the condensation reaction. 

25 The condensation reaction requires several activities including acyl carrier protein (ACP), p-ketosynthase (KS), 

and acyltransf erase (AT). Once a condensation has occurred, the resulting p-carbonyl functional group may be mod- 
ified. Post-condensation activities that may be involved include p-ketoreductase (KR), dehydratase (DH), and enoylre- 
ductase (ER). Polyketide biosynthesis is terminated by a thioesterase (TE) activity. Whether all, some, or none of these 
activities act after a particular condensation step determines the structure of the final product. 

so The present invention provides, in particular, the DNA sequence encoding the polyketide synthase responsible for 

biosynthesis of tylactone, i.e., tylactone synthase. Tylactone itself is the polyketide backbone of the commercially sig- 
nificant antibiotic tylosin. The tylactone synthase DNA sequence, which defines the tylactone synthase gene cluster, 
directs biosynthesis of the tylactone polyketide by encoding the various distinct activities of tylactone synthase. 

The gene cluster for tylactone synthase, like other complex polyketide biosynthetic genes whose organization has 

35 been elucidated, is characterized by the presence of several open reading frames (ORFs), each of which contains one 
or more repeated units termed "modules." A module is defined as the genetic element encoding all of the distinct 
activities required in a single round of synthesis, i.e., one condensation step and all the p-carbonyl processing steps 
associated therewith. Each module encodes an ACP, a KS, and an AT activity to accomplish the condensation portion 
of the synthesis, and selected post-condensation activities to effect p-carbonyl processing. The polypeptides encoded 

40 by such modules are termed "synthase units" (SUs). 

Each module is further characterized by the inclusion of submodules that are responsible for encoding the distinct 
activities of tylactone synthase. For purposes of the present invention, the term "submodule" is defined as the portion 
of the polyketide synthase DNA sequence that encodes a distinct activity, or "domain". Thus, a domain is taken as 
commonly understood to mean that part of the polyketide synthase polypeptide necessary for a given distinct activity. 

4 5 Organization of the tylactone synthase gene cluster derived from Streptomyces fradiae, is shown in Figure 1 . The 

tylactone synthetic pathway, with indications of the specific carboxylic acid substrates used for each condensation 
reaction and the various post-condensation activities, is shown in Figure 2. 

A preferred DNA molecule comprising the tylactone synthase gene cluster isolated from Streptomyces fradiae is 
represented by SEQ I D NO: 1 . Other preferred DNA molecules of the present invention include the various open reading 

50 frames of SEQ ID NO:1 that encode individual multi-functional polypeptides. These are represented by the following 
nucleotide residues of SEQ ID NO:1 : ORF1 81 6 to 1 4234, ORF2 1 4351 to 1 9945, ORF3 2001 0 to 311 99, ORF4 31 232 
to 36067, and ORFS 36249 to 41774. The predicted amino acid sequences of the various peptides encoded by these 
sequences are shown in SEQ ID NO:2, 3, 4, 5, and 6, respectively. 

Yet other preferred DNA molecules of the present invention include the modules that encode the synthase units, 

55 which include all the activities necessary for a single round of synthesis. These are represented by the following nu- 
cleotide residues of SEQ ID NO:1: Starter Module 942 to 3929, Module 1 3993 to 8471, Module 2 8541 to 13970, 
Module 3 1 441 1 to 1 9666, Module 4 201 36 to 24611 , Module 5 24675 to 30902, Module 6 31 337 to 35743, and Module 
7 36360 to 40826. The predicted amino acid sequences of the various synthase units encoded by these modules are 



3 
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represented, respectively, by the following amino acid residues: Starter SU 43 to 1038, SU1 1060 to 2552 and SU2 
2576 to 4385 in SEQ ID NO:2; SU3 21 to 1772 in SEQ ID NO:3; SU4 43 to 1534 and SU5 1556 to 3631 in SEQ ID 
NO:4; SU6 36 to 1504 in SEQ ID NO:5; and SU7 38 to 1526 in SEQ ID NO:6. 

Still other preferred DNA molecules include the various submodules that encode the various domains of tylactone 
5 synthase. These submodules are represented by the following nucleotide residues: KS°(s) 942 to 2156, AT(s) 2571 
to 3557, ACP(s) 3675 to 3929, KS1 3993 to 5264, AT1 5631 to 6617, KR1 7410 to 7949. and ACP1 8220 to 8471 of 
Module 1 in SEQ ID NO:1; KS2 8541 to 9812, AT2 10260 to 11246, DH2 11319 to 11876, KR2 12861 to 13415, and 
ACP2 13719 to 13970 of Module 2 in SEQ ID NO:1; KS3 14411 to 15697, AT3 16055 to 17122, DH3 17198 to 17794, 
KR3 18584 to 19138, and ACP3 19415 to 19666 of Module 3 in SEQ ID NO:1; KS4 20136 to 21404, AT4 21771 to 

10 22757, KR'4 23541 to 24077, and ACP4 24360 to 24611 of Module 4 in SEQ ID NO: 1 ; KS5 24675 to 25949, AT5 26292 
to 27284, DH5 27360 to 27917, ER5 28767 to 29813, KR5 29829 to 30368, and ACP5 30651 to 30902 of Module 5 
in SEQ ID NO: 1 ; KS6 31 337 to 32608, AT6 32975 to 33961 , KR6 34694 to 35236, and ACP6 35492 to 35743 of Module 
6 in SEQ ID NO:1 ; K$7 36360 to 37631 , AT7 37989 to 38987, KR7 39759 to 4031 3, ACP7 40575 to 40826. and TE7 
41 235 to 41 333 of Module 7 in SEQ ID NO: 1 . 

15 The predicted amino acid sequences of the various domains encoded by these submodules are represented, 

respectively, by KS Q (s) 43 to 447, AT(s) 586 to 914, ACP(s) 954 to 1038, KS1 1060 to 1483, AT1 1606 to 1 934, KR1 
2199 to 2378, and ACP1 2469 to 2552 in SEQ ID NO:2; KS2 2576 to 2999, AT2 3149 to 3477, DH2 3502 to 3687, 
KR2 4016 to 4200, and ACP2 4302 to 4385 in SEQ ID NO:2; KS3 21 to 449, AT3 569 to 924, DH3 950 to 1148, KR3 
1 41 2 to 1 596, and ACP3 1 689 to 1 772 in SEQ I D NO:3; KS4 43 to 465, AT4 588 to 91 6, KR'4 1 1 78 to 1 356, and ACP4 

20 1451 to 1534 in SEQ ID NO:4; KS5 1556 to 1980, ATS 2095 to 2425, DH5 2451 to 2636, KR5 3274 to 3453, and ACP5 
3548 to 3631 in SEQ ID NO:4; KS6 36 to 459, AT6 582 to 910, KR6 1155 to 1335, and ACP6 1421 to 1504 in SEQ ID 
NO:5; KS7 38 to 461 , AT7 581 to 913, KR7 1171 to 1 355, ACP7 1 443 to 1 526, and TE7 1 663 to 1695 in SEQ ID NO:6. 

Although not wishing to be bound to any particular technical explanation, sequence similarity exists among domain 
boundaries in various polyketide synthase genes. Thus, one skilled in the art is able to predict the domain boundaries 

25 of newly discovered polyketide synthase genes based on the sequence information of known polyketide synthase 
genes. In particular, the boundaries of submodules, domains, and open reading frames in the instant application are 
predicted based on sequence information disclosed in the instant application and the locations of the domain bound- 
aries of the erythromycin polyketide synthase (Donadio et al., Gene 111:51 (1992)). 

The DNA sequence of the tylactone synthase gene was determined from recombinant DNA clones prepared from 

30 the DNA of Streptomyces fradiae. The tylactone synthase gene is contained in recombinant DNA vectors pSET506 
and pSET507 (Figure 3), which are available from the National Center for Agricultural Utilization Research, 1815 North 
University Street, Peoria, Illinois 61604-3999, in E. co//K12 MM294 under accession numbers NRRL B-18688 (depos- 
ited July 19, 1990) and NRRL B-18689 (deposited July 19, 1990), respectively. 

Tehcniques for isolating bacterial DNA are readily available and well known in the art. Any such technique can be 

55 employed in this invention. In particular, DNA from these cultures is isolated as follows. Lyophils of E. coli K12 
MM294/pSET506 or E. co// K12 MM294/pSET507 are plated onto L-agar (10 g tryptone, 10 g NaCI, 5 g yeast extract, 
and 1 5 g agar per liter) plates containing 1 00 ng/ml ampicillin to obtain a single colony isolate of the strain. This colony 
is used to inoculate about 500 ml of L-broth (10 g tryptone, 10 g NaCI, 5 g yeast extract per liter) containing 100 jig/ 
ml ampicillin, and the resulting culture is incubated at 37° C with aeration until the cells reach stationary phase. Cosmid 

40 DNA is obtained from the cells in accordance with procedures known in the art (see e.g., Rao et al, 1 987 in Methods 
in Enzymology, 153:166). 

DNA of the current invention can be sequenced using any known techniques in the art such as the dideoxynucle- 
otide chain-termination method (Sanger, et al., Proc. Natl. Acad. Sci. 74:5463 (1977)) with either radioisotopic or flu- 
orescent labels. Double-stranded, supercoiled DNA was used directly for templates in sequence reactions with se- 

45 quence-specific oligonucleotide primers. Alternatively, fragments were used to prepare libraries of either random, over- 
lapping sequences in the bacteriophage M13 or nested, overlapping deletions in a plasmid vector. Individual recom- 
binant DNA subclones were then sequenced with vector-specific oligonucleotide primers. Radioactive reaction products 
were electrophoresed on denaturing polyacrylamide gels and analyzed by autoradiography. Fluorescent- labeled re- 
action products were electrophoresed and analyzed on Applied Biosystems (ABI Division, Perkin Elmer, Foster City, 

50 CA 94404) model 370A and 373A or DuPont (Wilmington, DE) Genesis DNA sequencers. Sequence data was assem- 
bled and edited using Genetic Center Group (GCG, Madison, Wl ) programs Gel Assemble and Seqed or the ABI model 
670 Inherit Sequence Analysis system and the AutoAssembler and SeqEd programs. 

Polypeptides corresponding to a domain, a synthase unit, or a multi-functional polypeptide can be produced by 
expression of the cDNA sequence in a bacteria, for example, using known expression vectors. Alternatively, the 

55 polypeptides mentioned above can be extracted from tylactone-producing bacteria such as Streptomyces frad'tae. In 
addition, the techniques of synthetic chemistry can be employed to synthesize the polypeptides mentioned above. The 
procedures and techniques for isolation and purification of homogenous protein or polypeptides are well known in the 
art. 



4 
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Since the genetic organization of the tylactone synthase gene cluster appears to correspond to the order of the 
reactions required to complete synthesis of tylactone, knowledge of the tylactone synthase DNA sequence, its genetic 
organization, and the activities associated with particular open reading frames, modules, and submodules of the gene 
enables production of novel polyketides having a predicted structure that are not otherwise available. Modifications 
may be made to the DNA sequence that either alter the initial carboxylic acid building block used or the building block 
added at any of the condensation steps. The tylactone synthase gene may also be modified to alter the actual number 
of condensation steps done, thereby changing the size of the carbon backbone. Modifications to portions of the DNA 
sequence that encode the post condensation processing activities will alter the functional groups appearing at the 
various condensation sites on the carbon chain backbone. 

These modifications can be accomplished by substituting submodules derived from the tylactone synthase gene 
and having known activities for corresponding submodules from another polyketide synthase gene having different 
activities. Submodules from tylactone synthase may also be combined with submodules from other polyketide synthase 
genes to effect additional catalytic steps. Accordingly, a DNA molecule wherein at least one submodule from the ty- 
lactone synthase gene has been combined with, or substituted for, submodules from the DNA sequence of other 
polyketide synthase genes is also provided by the present invention. Further, submodules that are a part of the present 
invention may be selectively inactivated thereby giving rise to predictable novel polyketide structures. 

For example, a submodule encoding a KS' activity, an AT activity, and an ACP activity, all derived from the first 
open reading frame of the tylactone synthase gene (ORF1), were exchanged for the corresponding submodule in the 
platenolide synthase gene (see Example 2). Platenolide is the polyketide backbone of the antibiotic spiramycin. 

The submodule from the tylactone synthase gene encodes a domain that catalyzes the incorporation of a propi- 
onate as the initial building block whereas the corresponding submodule of the platenolide synthase gene encodes a 
domain that catalyzes the incorporation of an acetate building block. When the resulting DNA molecule was placed 
into a bacterial strain and grown under conditions promoting polyketide synthesis, a hybrid molecule having the struc- 
ture that would be predicted by incorporation of an additional methyl side-chain at the start of the growing polyketide 
chain was synthesized, in particular, 16-methyl platenolide. 

One skilled in the art is fully familiar with the degeneracy of the genetic code. Consequently, the skilled artisan can 
modify the specific DNA sequences provided by this disclosure to provide proteins having the same or improved char- 
acteristics compared to those polypeptides specifically provided herein. Also, one skilled in the art can modify the DNA 
sequences to express an identical protein to those provided, albeit expressed at higher levels. Furthermore, one skilled 
in the art is familiar with means to prepare synthetically, either partially, or in whole, DNA sequences which would be 
useful in preparing recombinant DNA vectors or coding sequences which are encompassed by the current invention. 
Additionally, recombinant means for modifying the DNA sequences provided may include for example site<Jirected 
deletion or site-directed mutagenesis. These techniques are well known to those skilled in the art and require no further 
elaboration here. Consequently, as used herein, DNA which is isolated from natural sources, prepared synthetically 
or semi-synthetically, or which are modified by recombinant DNA methods, are within the scope of the present invention. 

Likewise, those skilled in the art will recognize that the polypeptides of the invention may be expressed recom- 
binant!^ Alternatively, these polypeptides may be synthesized as well, either in whole or in part, by conventional known 
non-recombinant techniques; for example, solid-phase synthesis. Thus, the present invention should not be construed 
as necessarily limited to any specific vector constructions or means for production of the specific polyketide synthase 
molecules exemplified. These alternate means for preparing the present polypeptides are meant to be encompassed 
by the present invention. 

Many cyclized polyketides undergo glycosylation at one or more sites. Tylosin is a 16-membered cyclic lactone, 
tylactone, with three attached sugar residues. The process of converting tylactone to tylosin is will known in the art! 
The present invention also provides the information needed to synthesize novel tylosin -related polyketides based on 
tylactone. The principles have already been described above. I n addition , any product resulting from post-transcriptional 
or post-translational modification in vivo or in vitro based on the DNA sequence information disclosed herein are meant 
to be encompassed by the present invention. 

The following examples are provided for exemplification purposes only and are not intended to limit the scope of 
the invention which has been described in broad terms above. 



EXAMPLE 1 



The DNA sequence of the Streptomyces fradiae tylactone synthase gene, tylG, was obtained by sequencing the 
inserts of recombinant DNA subclones containing contiguous or overlapping DNA segments, which when considered 
cumulatively span the entire region of fy/G. All sequences representing tylG are fully contained in the overlapping 
cosmid subclones pSET506 and pSET507. 

In accord with the current invention, the sequence may now be obtained by subcloning and sequencing the DNA 
fragment designated by EcoRI restriction sites at 3.0 and 4.0 kb on the map presented in Figure 3, the fragment bounded 
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by the EccR\ site at 4.0 kb and the BamHl site at 7.5 kb, the fragments bounded by BamH\ sites at 7 5 12 0 23 4 
27.3, 34.6, 39.5, and 50.5 kb. 

In order to obtain the tylG gene on a single DNA fragment, the 31 .6 kb DNA fragment bounded by the EcoH\ site 
at 3.0 kb and the BamH\ site at 34.6 kb is isolated from a partial digestion of pSET506 with the restriction enzymes 
EcoRI and BamHl. The 15.9 kb DNA fragment bounded by the BamH\ sites at 34.6 and 50.5 kb is isolated from a 
partial digestion of pSET507 with the restriction enzyme BamHl. The resulting fragments are ligated and cloned in an 
appropriate recombinant DNA vector. Clones containing the correct orientation of the two ligated fragments are iden- 
tified by restriction enzyme site mapping. 

EXAMPLE 2 

Production of a polyketide hybrid of platenolide and tylactone 

The lactone rings of the polyketides platenolide and tylactone undergo an identical set of post-condensation 
processing steps. However, these two polyketides are synthesized by condensation of more than one type of carboxylic 
acid and the specific building blocks chosen for incorporation in the two pathways differ. The organization of the pla- 
tenolide gene cluster is shown in Figure 4 along with the accompanying synthetic pathway in Figure 5. The specific 
carboxylic acid substrates that are used for each condensation reaction and the posKondensation activities of plate- 
nolide synthesis are indicated. 

The DNA sequence of the Streptomyces ambofaciens platenolide synthase (srmG) genes was obtained by se- 
quencing inserts of recombinant DNA subclones containing contiguous or overlapping DNA segments, which when 
considered together span the entire srmG region. All sequences representing srmG are fully contained in the overlap- 
ping cosmid clones pKC1080 and pKC1 306 (Figure 6). The sequence can be obtained by subcloning and sequencing 
the fragments bounded by Nru\ sites at position 1 , 0. 3 kb, 8.2 kb, 1 4. 1 kb, 20.2 kb, 29.5 kb, 31 .4 kb, 41 . 1 kb, and 42.0 kb. 

In order to obtain the srmG region on a single fragment, the 25.0 kb fragment bounded by the NnA site at position 
1 and the SfiA site at 25.0 kb is isolated from a partial digestion of pKC1080 with restriction enzymes Nrti and Sfu\. 
The 17.8 kb DNA fragment bounded by the Sfu\ sites at 25.0 kb and 42.8 kb is isolated by digestion of pKC1 306 with 
the restriction enzyme SfiA. The resulting fragments are ligated and cloned in an appropriate recombinant DNA vector. 
Clones containing the correct orientation of the two ligated fragments are identified by restriction enzyme mapping. 

An exchange of tylactone polyketide synthase DNA with corresponding platenolide DNA was effected as follows 
in order to generate a novel polyketide structure. 

A strain of Streptomyces ambofaciens, the organism that produces platenolide, was constructed with most of ORF 1 
deleted. This ORF1 -deficient strain produced no detectable platenolide. To confirm that the lack of ORF1 was the only 
deficiency in platenolide production, a construct containing ORF1 , and not any functional activities of ORFs 2-5, was 
introduced into the ORF1 -deficient strain on a vector that contains the site-specific integration function from the strep- 
tomycete phage *C31 . Integration of ORF1 at the <j>C31 att site restored spiramycin production to parental levels, 
confirming that ORF1 codes for a functional protein and that ORFs 2-5 are expressed in the ORF1 -deficient strain. 

ORF1 of tylG is organized like s/vnGORFI . DNA coding for a KS\ an AT, and an ACPfrom <y/ORF1 was exchanged 
with the corresponding region from srm ORF1 by replacing an EcoR\-Apa\ fragment of srm ORF1 with an EcoR\-SftA 
fragment from ry/ORF1 , generating plasmid pKC 1 524. When pKC 1 524 was introduced into the ORF1 -deficient strain, 
polyketide synthesis was restored. The products produced by this restored strain were indistinguishable from those 
produced by the parental strain of Streptomyces ambofaciens when analyzed by thin-layer chromatography and HPLC. 

The hybrid ORF1 DNA sequence is SEQ ID NO:7 and the amino acid sequence of the ORF1 -encoded polypeptide 
is SEQ ID NO:8. The rest of the DNA sequence of the hybrid molecule is identical to original tylG sequence and the 
polypeptides encoded by this remaining portion are therefore identical as well. 

In order to simplify physical analysis of the putative novel polyketide, an Xho\ fragment encoding sugar synthesis 
or addition was deleted from the restored. Deletion of this fragment led to a strain that produced only the lactone without 
any sugar residues added. When analyzed by NMR and mass spectroscopy, the isolated product made by this sugar- 
deficient strain was confirmed as 16-methylplatenolide, which is the structure that would be predicted from the incor- 
poration of a propionate in place of the acetate normally utilized in platenolide synthesis. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



fi> APPLICANT: ELI LILLY AND COMPANY 

(B) STREET: Lilly Corporate Center 

(C) CITY: Indianapolis 

(D) STATE: Indiana 

(E) COUNTRY: United States of America 

(F) ZIP : 46285 



(ii) TITLE OF INVENTION: POLYKETIDE SYNTHASE GENES 

(iii) NUMBER OF SEQUENCES: 15 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: K. G. TAPPING 

(B) STREET: Erl Wood Manor 
(C> CITY: Windlesham 

( D ) STATE : Sur r ey 

(E) COUNTRY: United Kingdom 
IF) ZIP: GU20 6PH 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: Macintosh 7.0 

(D) SOFTWARE: Microsoft Word 5.1 



2) INFORMATION FOR SEQ ID NO:l; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 816.. 14234 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION : 14351.. 19945 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 20010.. 31199 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 31232.. 36067 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 36249.. 41774 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAATTCGCGG TCCGCTCGGG TTCCGCTCCG TTTTCTGCTT CGAGCCTCTC TGTCGTCATC 

TCGGCTCTCT CATCGGGCTC GTGCGGGAAG GCATCCCGGT GCGGGGAGAC ATCCTCCTCC 

GGGAAGGAAT CCTGGTGCGG GAAGGCAACG ACTGCGGGAC GCGGGAGAAA GGGGAATCGG 

CGGGAATTTC CCCCGCGCGG CGGGGACGGT GCCGGAGAAC AACGGCGGGG AAACAGCCCG 

CGGTCCGTGA CGACAACGGA AACTATGGTC CGCTTCCTCC GTCCACAAGG CGGAACCTGA 

CATAGTCCCC GCCCACGCGG AAATCCCGCA CGGCGGCCCG CCGGCCGCCG CACCGGACCT 

GACATAGCCT CGCCGGACCG CTCCGGTGCG GCCACCCCGT TGGTCTTGGG TGATGAGGTA 

CCGGATCAGA GGAGAAAGCA CCATCCCCCG CCCCTCGGCC AGCGAACCGC GCGGGACCAC 

CCGTTCGGCG ACCGCGCTGG CGCGCCGCCG TCGACCGGGC CGTAACTCCC CTCCGCCATC 

GAATACITCG CCCCTCGAAT CCCTCACCGG GCGAGTTCCA GGACCGCCCC TCGCTCTCGC 

CATACCGGAG AACGAACCCG AACGGCACGG CGGAAAGCCC GTCCGCAATC CCCGGGACAT 

TCCTGTGACC CGACAACACG CTTTGCCGAC ACGGTTCGCC GCAACCCTTG TTGGCAGGCT 

CACCGGCACG GCCCGCTGAC ACAGCTCGGT GACACGGCAG CCTGACGGGA AACCGCCGAA 

GCCTCTCGAG TCCTCGCACA TTCCGGAGAG AACAG GTG TCT TCC GCG CTC CGG 

Met Ser Ser Ala Leu Arg 
1 5 

CGC GCG GTG CAA TCC AAC TCT GGC TAC GGA GAC CTC ATC ACC TCG AAC 
Arg Ala Val Gin Ser Asn Cys Gly Tyr Gly Asp Leu Met Thr Ser Asn 
10 15 20 

ACC GCT GCA CAG AAC ACC GGC GAC CAG CAA GAC CTC GAC OCT CCC GAC 
Thr Ala Ala Gin Asn Thr Gly Asp Gin Glu Asp Val Asp Gly Pro Asp 
25 30 35 

AGC ACA CAC GGT GGG GAG ATC GCC GTC GTG GGA ATG TCG TGC CGT CTC 
Ser Thr His Gly Gly Glu He Ala Val Val Gly Met Ser Cys Arg Leu 
40 45 50 

CCG GGC GCG GCC GGT GTC GAG GAA TTC TCG GAA CTC CTC CGC AGC GGA 
Pro Gly Ala Ala Gly Val Glu Glu Phe Trp Glu Leu Leu Arg Ser Gly 
55 60 65 70 
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CGC GGT ATG CCC ACC CGT CAG GAC GAC GGC ACC TGG COG GCC GCC CTC 1073 

Arg Gly Met Pro Thr Arg Gin Asp Asp Gly Thr Trp Arg Ala Ala Leu 
75 80 85 

5 

GAG GAC CAC GCC GGC TTC GAC GCC GGG TTC TTC GGC ATG AAC GCC CGG 1121 

Glu Asp His Ala Gly Phe Asp Ala Gly Phe Phe Gly Met Asn Ala Arg 
90 95 100 

70 CAG GCC GCC GCC ACC GAC CCG CAG CAC CGA CTG ATG CTG GAA CTC GGA 1169 

Gin Ala Ala Ala Thr Asp Pro Gin His Arg Leu Met Leu Glu Leu Gly 
105 110 115 

TGG GAG GCA CTG GAG GAC GCG GGC ATC GTC CCC GGC GAT CTC ACC GGC 1217 
Trp Glu Ala Leu Glu Asp Ala Gly He Val Pro Gly Asp Leu Thr Gly 
15 120 125 130 

ACC GAC ACC GGA GTC TTC GCC GGC CTG GCG TCC GAC GAC TAT GCC GTT 1265 
Thr Asp Thr Gly Val Phe Ala Gly Val Ala Ser Asp Asp Tyr Ala Val 
135 140 145 150 

20 

CTC ACC CGC CGT TCC GCC GTC TCC GCC GGG GGA TAC ACC GCC ACG GGG 1313 
Leu Thr Arg Arg Ser Ala Val Ser Ala Gly Gly Tyr Thr Ala Thr Gly 
155 160 165 

CTG CAC CGC GCC CTG GCC GCC AAC CGC CTC TCC CAC TTC CTG GGC CTG 1361 
Leu His Arg Ala Leu Ala Ala Asn Arg Leu Ser His Phe Leu Gly Leu 
170 175 180 

CGC GGC CCC AGC CTG GTC GTC GAC TCG GCC CAG TCC GCC TCA CTG GTG 1409 
Arg Gly Pro Ser Leu Val Val Asp Ser Ala Gin Ser Ala Ser Leu Val 
30 185 190 195 

GCC GTC CAG CTC GCC TGC GAG AGT CTG CGC CGG GGT GAG ACG TCG CTC 1457 
Ala Val Gin Leu Ala Cys Glu Ser Leu Arg Arg Gly Glu Thr Ser Leu 
200 205 210 



25 
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40 
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50 
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GCC CTC GCG GGC OCT GTC AAC CTC ATC CTC ACC GAG GAG AGC ACC ACC 1505 
Ala Val Ala Gly Gly Val Asn Leu He Leu Thr Glu Glu Ser Thr Thr 
215 220 225 230 

GTC ATG GAG CGT ATG GGA GCG CTC TCA CCC GAC GGC CGC TGC CAC ACC 1553 
Val Met Glu Arg Met Gly Ala Leu Ser Pro Asp Gly Ary Cys His Thr 
235 240 245 

TTC GAC GCC CGC GCC AAC GGC TAC GTA CGC GGC GAG GGC GGC GGA GCC 1601 
Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg Gly Glu Gly Gly Gly Ala 
250 255 260 

GTC GTG CTC AAG CCA CTG GAC GCC GCA CTC GCC GAC GGC GAC CGC GTG 1649 
Val Val Leu Lys Pro Leu Asp Ala Ala Leu Ala Asp Gly Asp Arg Val 
265 270 275 

TAC TGC GTC ATC AAG GGA GGT GCC GTC AAC AAC GAC GGC GGC GGC GCG 1697 
Tyr Cys Val He Lys Gly Gly Ala Val Asn Asn Asp Gly Gly Gly Ala 
280 285 290 

AGC CTC ACC ACT CCC GAC CGG GAG GCG CAG GAA GCT GTG CTG CGC CAG 1745 
Ser Leu Thr Thr Pro Asp Arg Glu Ala Gin Glu Ala Val Leu Arg Gin 
295 300 305 310 
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GTC ACC GGT GCG GTT CTC GAC QGT GGT GTG GTT GTC GGT GCT GCC CCC 2513 
Val Thr Gly Ala Val Leu Asp Gly Gly Val Val Val Gly Ala Ala Pro 
555 560 565 

GGC GGT GCC GGT GCT GCC GCT GGT GCC GGT GCT GCC GGT GGT GCC GGT 2561 
Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly 
570 575 580 

GGT GGG GGC GTG GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TOG GTG 2609 
Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val 
585 590 595 



GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG 2657 
Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser 
75 600 605 610 

ATG CGG GAG TGT GCG CGG GCG CTG AGT GTT CAT GTG GGG TGG GAT TTG 2705 
Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp Asp Leu 
615 620 625 630 

20 

CTG GAG GTG GTG TCG GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTG 2753 
Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val 
635 640 645 

CAG CCG GTG ACG TGG GCG GTG ATG GTG TCG CTG GCC CGG TAC TGG CAG 2801 
Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin 
650 655 660 

GCG ATG GGT GTG GAC GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG 2849 
Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin Gly Glu 
30 665 670 675 

ATC GCC GCT GCC ACG GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG 2897 
lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala 
680 685 690 

35 GCT GTG GTC GCT CTG CGG GCG GGG TTC ATT GGC CGG TAT CTG GCG GGT 2945 

Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly 
695 700 705 710 

CGT GCT GCG ATC GCG GCT GTT CCG CTC CCT GCC GGC GAG GTC GAG GCC 2993 
40 Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala 

715 720 725 

GGG CTG GCG AAG TGG CCG GGT GTG GAG GTC GCG GCG GTC AAC GGT CCG 3041 
Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Aan Gly Pro 

45 730 735 740 

GCG TCT ACG GTG GTT TCC GGG GAT CGG CGG GCG GTG GCC GGT TAT GTG 3089 
Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val 
745 750 755 

GCC GTC TGT CAG GCG GAG GGT GTG CAG GCC CGG TTG ATA CCG GTG GAC 3137 
Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He Pro Val Asp 
760 765 770 

TAC GCC TCT CAC TCC CGC CAT GTG GAG GAC CTG AAG GGC GAG TTG GAG 3185 
55 Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu 

775 780 785 790 



11 



10 



20 



25 



35 



40 



45 



SO 



55 



EP 0 791 655 A2 



CGG GTG CTG TCC QGT ATC CGC CCC CGC AGT CCG CGG GTG CCG GTG TGT 3233 
Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro Val Cys 
^95 800 805 

TCC ACC GTC GCC GGA GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG 3281 
Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly 
810 815 820 

TAT TGG TTC CGT AAT CTG CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC 3329 
Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val 
825 830 835 



GGT GGT TTG TTG GAG GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC 3377 
Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val Ser Ala 
15 840 845 8S0 

CAC CCG GTA CTC GTC CAT GCG ATC GAG CAG ACC GCC GAG GCC GCG GAC 3425 
His Pro Val Leu Val His Ala He Clu Gin Thr Ala Glu Ala Ala Asp 
855 860 865 870 



CGG AGT GTC CAT GCC ACC GGG ACC CTG CCC CGC CAG GAC GAC AGC CCG 3473 
Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro 
87 5 880 885 

CAC CGC CTG CTG ACC TCC ACC GCC GAG GCC TGG CCC CAC GGC GCC ACC 3521 
His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr 
890 895 900 

^ £5 C 1°° °AC CCC GCC CTG CCC CCA GGC CAC CTC ACC ACC CTC CCC 3569 
Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro 
30 905 910 915 

ACC TAC CCC TTC AAC CAC CAC CAC TAC TGG CTC GAC ACC ATT GAC GGG 
Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr lie Asp Gly 

GGC GGA OCW CAC GAC GCG ACC CAC GAG AAG CAG ACC GGC CCT CTC ACG 3665 
Gly Gly Gly Asp Asp Ala Thr Gin Glu Lys Glu Ser Gly Pro Leu Thr 
935 540 945 950 

<^^^cm<^cmccarccrcTC^M^cjjiCTG<^TKciG 3713 

Arg Olu Leu Arg Oly Leu Pro Ser Ser Gin Lye Gin Leu Gly Phe Leu 



955 960 



965 



3617 



3809 



CTC GAT CTG GTG TCC CGG CAC ACG GCC GTC GTA CTC GGC CTG GAC ACG 3761 
Leu Asp Leu Val Cys Arg His Thr Ala Val Val Leu Gly Leu Asp Thr 
970 975 9 8 o 

GCC GCC GAG GTG GAC CCG GAC CTG TCC TTC AAG AAG CAG GGC ATC CAG 
Ala Ala Glu Val Asp Pro Asp Leu Ser Phe Lys Lys Gin Gly He Gin 
985 990 995 

TCC ATG ACC GGC GTC GAG CTG CGC AAC AGG CTC CTG ACC GAG ACC GGC 

S * r ^L Thr Gly Val Glu Leu **9 Le « Leu Thr Glu Thr Gly 

1000 1005 1010 

CTG GCA TTC CCC ACC ACC CTC GTC TAC GAC CGG CCC ACC CCT CGC GCC 
Leu Ala Leu Pro Thr Thr Leu Val Tyr Asp Arg Pro Thr Pro Arg Ala 
1015 "20 1025 1030 
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CTC GCG CAG TTC CTC CAC ACC GAG TTG CTC GAC GGC TCC CCC TCG GGC 
Leu Ala Gin Phe Leu His Thr Glu Leu Leu Asp Gly Ser Pro Ser Gly 
1035 1040 1045 

TCC GTC CTC GCA CCG GCG CAG AAG AGC TTC GAA GCC CAG GAG CCG ATC 
Ser Val Leu Ala Pro Ala Gin Lys Ser Phe Glu Ala Gin Glu Pro He 
1050 1055 1060 

CCG GTC CTG GGT ATG GGC TGC CGG TTC CCC GGT GGG GTG GGT TCG CCG 
Ala Val Val Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro 
1065 1070 1075 

GAG GCG TTG TGG CGG TTG GTG GTG GAG GGG GTG GAC GCG GTT TCC CCG 
Glu Ala Leu Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro 
1080 1085 1090 

TTT CCC GGT GAT CGT GGC TGG GAT GTG GAG GGG TTG TAC GAC CCG GAG 
Phe Pro Gly Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu 
1° 95 HOO H05 mo 

CCG GGT GTG GCG GGG AAG TCG TAT GTG CGG GAG GGG GGT TTT CTG CAT 
Pro Gly Val Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu His 
1115 1120 H25 

GAT GCG GCG GAG TTC GAT GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG 
Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu 
1130 H35 U40 

GCG GTG GCG ATG GAT CCG CAG CAG CGG CTG TTG CTG GAG ACC TCC TOG 
Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp 
1145 H50 H55 

GAG GCG ATC GAG CGG GCG GGT ATC GAC CCG CAC TCG CTG CAC GGC AGC 
Glu Ala He Glu Arg Ala Gly He Asp Pro His Ser Leu His Gly Ser 
1160 H65 H70 

CGC ACC GGC GTC TAC GCC GGC GTG ATC CCG CAG GAA TAC GGA CCT CGG 
Arg Thr Gly Val Tyr Ala Gly Val Met Pro Gin Glu Tyr Gly Pro Arg 
1175 H80 H85 U90 

CTC GCC GAA GGA GCG GAA GGC AGC GAC GGC TAC CTC CTC ACC GGT ACG 
Leu Ala Glu Gly Ala Glu Gly Ser Asp Gly Tyr Leu Leu Thr Gly Thr 
1195 1200 1205 

TCG GGG AGT GTG GTG TCG GGG CGT GTG GCC TAC ACG CTG GGG CTG GAG 
Ser Gly Ser Val Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu 
121° 1215 1220 

CGT CCG GCC GTC ACC GTC GAT ACG GCG TGT TCG TCG TCG TTG GTC GCG 
Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala 
1225 1230 1235 

TTG CAT CTG GCG GTG CAG GCG TTG CGG GGT GGC GAG TGT GAC ATC GCG 
Leu His Leu Ala Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala 
1240 1245 1250 

TTC GCC GGT GGT GTG ACC GTG ATG GCC GGG CCG GGG ATC TTC GTC GAG 
Leu Ala Gly Gly Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu 
1255 1260 1265 1270 
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TTT TCG CGG CAC CGG GGG TTC CCG GCC GAT GGG CGG TCC AAG GCG TTC 
Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe 
1275 1280 12 85 

GCG GAT GGG GCG GAT GGG ACC GCT TCG GCC GAG GGT GCG GGG GTG GTC 
Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val 
1290 1295 1300 

CTG CTG GAG CGG TTC TCG GAT GCC CGG CGG TTG GGG CAT CCG GTC TTG 
Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu 
1305 1310 1315 

GCG GTG GTG TGT GGG TCG GCG GTC AAT CAG GAC GGT GCG TCG AAT GGT 
Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
1320 1325 1330 

TTG ACG GCG CCG AGT GGT CCG TCG CAG GAG CGG GTG ATT CGT CAG GCG 
Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val lie Arg Gin Ala 
1335 1340 1345 i 350 

TTG GGG AAT GCG CGG TTG ACG GTC GCG GAT GTG GAT GTG GTC GAG GCG 
Leu Gly Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala 
1355 1360 13 6 5 

CAT GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG GCG CAG GCG TTC 
His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu 
1370 1375 1380 

CTG GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT CCG GTG TOG TTC 
Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu 
1385 1390 1395 

GGG TCG TTC AAG TCG AAT ATT GGT CAT GCT CAG GCG GCT GCG GGG GTC 
Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val 
1400 1405 1410 

GCT GGT GTC ATC AAG ATC GTC TTC GCG ATG CGG TAT GGG TCG TTC CCG 
Ala Gly Val He Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro 
1415 1420 1425 i4 30 

CGG ACG TTC CAT CTG GAT GAG CCG AGC CGG CAT GTC GAC TCG TCG GCT 
Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala 
1435 1440 1445 

GGT GGT GTC TCG TTC CTG ACC GAG GCG CGG GAG TCG CCG GGG GTC GAC 
Gly Gly Val Trp Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp 
1450 1455 1460 

CGG CCG CGT CCG GCG GCC GTC TCC GCC TTT GGT GTC AGT GGT ACC AAC 
Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn 
1465 1470 1475 

GCC CAT CTG ATC CTC GAA GCC CCC GAC ACC GCC GAG GCG GAG AGC GCC 
Ala His Leu He Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala 
1480 1485 1490 

ACG ACC CCG GTC CGC TCT GAG GTC TCG GAG TCT GCT GCG GTC CTC GAT 
Thr Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala Ala Val Leu Asd 
1495 1500 1505 isio 
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GCC CGC ACT GGT GTG GTG CCG GTG GTG GTT TCG GGG CGT TCG CGG GTG 5393 
Ala Arg Ser Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val 
1515 1520 1525 

GTG GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG GTG GAG GCC GGT 5441 
Val Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly 
1530 1535 1540 

GGT GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG GGC CGG TCG CGG 5489 
Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg 
1545 1550 1555 



TTT GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG GCT GAG CTT GCC 5537 
Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala 
15 1560 1565 1570 

GGG CGT TTG CGG GCG TTG GCG GGG GGT GAT CCG GAC GCG GGT GTG GTC 5585 
Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val 
1575 1580 1585 1590 

20 

ACC GGT GCG GTG GTG GAC CCG GAG ACG GGG TCC GGT GGT GGG GGC GTG 5633 
Thr Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val 
1595 1600 1605 

GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG 5681 
25 Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala 

1610 1615 1620 

GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TOG ATG CGG GAG TGT 5729 
Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys 
30 1 62 5 1 63 0 1 6 3 5 

GCG CGG GCG CTG AGT GTT CAT GTG GAG TGG GAT TTG CTG GAG GTG GTG 5777 
Ala Arg Ala Leu Ser Val His Val Glu Trp Asp Leu Leu Glu Val Val 
1640 1645 1650 

TCG GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTG CAG CCC GTG ACG 5825 
Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr 
1655 1660 1665 1670 

TGG GCG GTG ATG GTG TCG CTG GCC CGG TAC TGG CAG GCG ATG GGT GTG 5873 
40 Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val 

1675 1680 1685 

GAC GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG ATC GCT GCT GCC 5921 
A*p Val Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala 
45 1690 1695 1700 

ACG GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT 5969 
Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala 
1705 1710 1715 

SO CTG CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATG 6017 

Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met 
1720 1725 1730 

GCG GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG 6065 
55 Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys 

1735 1740 1745 1750 
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TGG CCG OCT GTG GAG GTC GCG GCG GTC AAC GGT CCG GOG TCC ACG GTC 
Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val 
1755 1760 1765 

GTT TCC GGG GAT CGG CGG GCG GTG GCC GGT TAT GTG GCC GTC TGT CAG 
Val Ser Gly Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin 
1770 1775 1780 

GCG GAG GGT GTG CAG GCT CGG TTG ATA CCG GTG GAC TAC GCC TCT CAC 
Ala Glu Gly Val Gin Ala Arg Leu lie Pro Val Asp Tyr Ala Ser His 
1785 1790 1795 

TCC CGC CAT GTG GAG GAC CTG AAG GGC GAG TTG GAG CGG GTG CTG TCC 
Ser Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser 
1800 1805 1810 

GGT ATC CGC CCC CGC AGT CCG CGG GTG CCG GTG TGT TCC ACC GTC GCC 
Gly lie Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala 
1815 1820 1825 1830 

GGA GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG TAT TCG TTC CGT 
Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg 
1835 1840 1845 

AAT CTG CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTG TTC 
Asn Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu 
1850 1855 i860 

GAG GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC 
Glu Glu Gly His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu 
1865 1870 1875 

GTC CAT GCG ATC GAG CAG ACG GCC GAG GCC GCG GAC CGG AGT CTC CAT 
Val His Ala He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His 
1880 1885 1890 

GCC ACC GGG ACC CTC CGC CGC CAG GAC GAC AGC CCG CAC CGC CTC CTC 
Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu 
1895 1900 1905 1910 

ACC TCC ACC GCC GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TCC GAC 
Thr Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu TTir Trp Asp 
1915 1920 1925 

CCC GCC CTC CCC CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC 
Pro Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe 
1930 1935 1940 

AAC CAC CAC CAC TAC TCG CTC GAC ACC ACC CCC ACC ACC CCC GCG ACG 
Asn His His His Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr 
1945 1950 1955 

ACC ACC CAG AGC CCC ACC GAT GCC TGG CGC TAC CGC GTC ACC TCG AAA 
Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg Tyr Arg Val Thr Trp Lvs 
I960 1965 197Q r 1 

GCC CTG ACC GAA TCC TCC CCC GTC CGC CCT CAC TCC ATC GGT CGC TCC 
Ala Leu Thr Glu Ser Ser Pro Val Arg Pro His Ser He Gly Arg Cvs 
1975 1980 1985 1990 
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CTC CTC GTT GCA CCC CCG ACC ACC GAC GGC GAG CTC CTC GAC OGA CTG 
Leu Leu Val Ala Pro Pro Thr Thr Asp Gly Glu Leu Leu Asp Gly Leu 
1995 2000 2005 

ACA ACG GTG TTG TCC GAG CGC GGT GCC TCC GTC GCC CGC CTT GAG GTG 
Thr Thr Val Leu Ser Glu Arg Gly Ala Ser Val Ala Arg Leu Glu Val 
2010 2015 2020 

CCC ATC GGC GCG CGG CGT GCC GAG GTC GCC GAA CTG CTC AAG CCC TCC 
Pro He Gly Ala Arg Arg Ala Glu Val Ala Glu Leu Leu Lys Pro Ser 
2025 2030 2035 

ATG GAG TCA GCG GGG GAG GAG AAC ACC ACC GTC GTC TCG CTT CTC GGT 
Met Glu Ser Ala Gly Glu Glu Asn Thr Thr Val Val Ser Leu Leu Gly 
2040 2045 2050 

CTG GTG CCC TCC ACG GAC GCG GTC AGG ACG TCG ATA GCG CTC CTC CAG 
Leu Val Pro Ser Thr Asp Ala Val Arg Thr Ser He Ala Leu Leu Gin 
2055 2060 2065 2070 

GCG GTC TCC GAC ATC GGC GTC CCG GCC GCC AGG GTC TGG GCG CTG ACG 
Ala Val Ser Asp He Gly Val Pro Ala Ala Arg Val Trp Ala Leu Thr 
2075 2080 2085 

CGG AGG GCC GTG GCC GTC GTT CCC GGG GAG ACG CCG CAG GAC GCG GGG 
Arg Arg Ala Val Ala Val Val Pro Gly Glu Thr Pro Gin Asp Ala Gly 
2090 2095 2100 

GCC CAG TTG TCG GGC TTC GGA CGA GTG GCG GCC CTT GAA CTC CCG GAT 
Ala Glu Leu Trp Gly Phe Gly Arg Val Ala Ala Leu Glu Leu Pro Asp 
2105 2110 2115 

ATC TGG GGC GGC TTG ATC GAT CTG CCG GAG ACA GCG GAG CTG ACG CGG 
He Trp Gly Gly Leu He Asp Leu Pro Glu Thr Ala Glu Leu Thr Ara 
2120 2125 2130 

ACG CCG GAG ACC TCA CAG CCC CCA CAG ACC CCG GAG AGG CTC CCC CAG 
Thr Pro Glu Thr Ser Gin Pro Pro Gin Thr Pro Glu Arg Leu Pro Gin 
2135 2140 2145 2150 

ACT CCG AAC CGA CGC GCC CTT GAG CTT GCT GCC GCC GTC CTC GCC GGC 
Thr Pro Asn Arg Arg Ala Leu Glu Leu Ala Ala Ala Val Leu Ala Gly 
2155 2160 2165 

CGC GAC GGC GAG GAC CAG GTC GCC GTC CGC GCC TCG GGG ATC TAC GGG 
Arg Asp Gly Glu Asp Gin Val Ala Val Arg Ala Ser Gly He Tyr Gly 
2170 2175 2180 

CGG CGG GTG TCG CGG GCC GCG GCA GCG GGG GCC GCC TCC TCG CAG CCG 
Arg Arg Val Ser Arg Ala Ala Ala Ala Gly Ala Ala Ser Trp Gin Pro 
2185 2190 2195 

TCC GGC ACG GTG CTG ATC ACC GGC GGC ATG GGT GCC ATC GGC AGG CGG 

.iL Thr Val LeU Ile G1 * G1 * Met G1 V Ala lie Gly Arg Arg 
2200 2205 2210 

CTC GCC CGC AGG CTC GCG GCC GAG GGA GCC GAA CGC CTC GTC CTC ACC 
Leu Ala Arg Arg Leu Ala Ala Glu Gly Ala Glu Arg Leu Val Leu Thr 
2215 22 20 2225 2230 
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AGC CGT CGC GGA CCG GAG GCG CCG GGC GCC GCC GAA CTC GCC GAG GAA 7553 
Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala Ala Glu Leu Ala Glu Glu 
2235 2240 2245 

CTG CGA GGA CAT GGC TCC GAG GTC GTG CAC GCG GCC TGT GAC GTC GCC 7601 
Lou Arg Gly His Gly Cys Glu Val Val His Ala Ala Cys Asp Val Ala 
2250 2255 2260 



10 GAG CGT GAT GCG CTC GCC GCG CTC GTC ACC GCG TAT CCG CCG AAC GCC 

Glu Arg Asp Ala Leu Ala Ala Leu Val Thr Ala Tyr Pro Pro Asn Ala 
2265 2270 2275 
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GTC TTC CAC ACC GCC GGG ATT CTG GAC GAC GCG GTG ATC GAC ACG CTC 7697 
Val Phe His Thr Ala Gly He Leu Asp Asp Ala Val He Asp Thr Leu 
2280 2285 2290 

TCA CCG GAG AGC TTC GAG ACC GTC CGC GGG GCG AAG GTC TGC GGC GCG 7745 
Ser Pro Glu Ser Phe Glu Thr Val Arg Gly Ala Lys Val Cys Gly Ala 
2295 2300 2305 2310 

GAG CTC CTG CAC CAA CTG ACT GCG GAC ATA AAA GGG TTC GAC GCC TTC 7793 
Glu Leu Leu His Gin Leu Thr Ala Asp lie Lys Gly Leu Asp Ala Phe 
2315 2320 2325 

GTC CTC TTC TCC TCC GTC ACC GGC ACA TOG GGC AAC GCC GGC CAG GGT 7841 
Val Leu Phe Ser Ser Val Thr Gly Thr Trp Gly Asn Ala Gly Gin Gly 
2330 2335 2340 

GCG TAC GCC GCC GCC AAC GCC GCG CTC GAC GCC CTC GCC GAG CGT CGC 7889 
Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala Glu Arg Arg 
2345 2350 2355 

CGT GCC GCC GGA CTC CCC GCG ACC TCC GTC GCC TCG GGC CTT TCG GGC 7937 
Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly Leu Trp Gly 
2360 2365 2370 

GGG GGA GGC ATC GOG GCG GGT GCG GGC GAG GAG AGT CTC TCG CGG CGA 7985 
Gly Gly Gly Met Ala Ala Gly Ala Gly Glu Glu Ser Leu Ser Arg Arg 
2375 2380 2385 2390 

GGG CTC CGG GCC ATC GAC CCC GAC GCG GCC GTC GAC GCG CTC CTC GGC 8033 
Gly Leu Arg Ala Met Asp Pro Asp Ala Ala Val Asp Ala Leu Leu Gly 
2395 2400 2405 

GCC ATC GGC AGG AAC GAC GTC TGC GTC ACT GTC GTC GAC GTC GAC TOG 8081 
Ala Met Gly Arg Asn Asp Val Cys Val Thr Val Val Asp Val Asp Trp 
2410 2415 2420 

GAG CGT TTC GCG CCC GCG ACG AAC GCC ATC CGT CCC GGG CGG CTC TTC 8129 
Glu Arg Phe Ala Pro Ala Thr Asn Ala He Arg Pro Gly Arg Leu Phe 
2425 2430 2435 

GAC ACC GTC CCG GAG GCG CGG GAG GCC CTG ACG GCA GCC GGC ACC ACG 8177 
Asp Thr Val Pro Glu Ala Arg Glu Ala Leu Thr Ala Ala Gly Thr Thr 
2440 2445 2450 

TCC GCG ACG CCG GAC GGC GCG CCC GAG CTG GCG CGG CGG TTC TCC ATC 8225 
Ser Ala Thr Pro Asp Gly Ala Pro Glu Leu Ala Arg Arg Leu Ser Met 
2455 2460 2465 2470 
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CTC AAC GAG ACC GAA CGC CTG COG AAG CTG GTC GAA CTC GTC CGT ACC 
Leu Asn Glu Thr Glu Arg Leu Arg Lys Leu Val Glu Leu Val Arg Thr 
2475 2480 2485 



8273 



GAG GCG GCC TTT GTG CTG CGG CAT CCG AAC ACG GAC GCC ATC GGC GCC 
Glu Ala Ala Phe Val Leu Arg His Pro Asn Thr Asp Ala He Cly Ala 
2490 2495 2500 



8321 



70 



GAA CGC CCG TTC AAG TCG GCC QGT TTC GAC TCC CTG ACC TCC CTG GAA 
Glu Arg Pro Phe Lys Ser Ala Gly Phe Asp Ser Leu Thr Ser Leu Glu 
2505 2510 2515 



8369 



15 



CTC CGC AAC CGC CTC AAT GCC GGC ACA GGC CTG AAG CTA CCC GCC ACC 
Leu Arg Asn Arg Leu Asn Ala Gly Thr Gly Leu Lys Leu Pro Ala Thr 
2520 2525 2530 



8417 



20 



GTC ATC TTC GAC CAC CCC AGC CCG ACC GCC CTG GCC AGA CTG CTG CTC 8465 
Val He Phe Asp His Pro Ser Pro Thr Ala Leu Ala Arg Leu Leu Leu 
2535 2540 2545 2550 

GAC CGG CTG ACC GGC GCC GGA GCC CCC GCG CCC GCC GCC GAT GAG CCG 8513 
Asp Arg Leu Thr Gly Ala Gly Ala Pro Ala Pro Ala Ala Asp Glu Pro 
2555 2560 2565 



25 



CCA CTG CCC GTC GCC GTG GCC GAC GAC GAC CCG GTG GTC ATC GTC GGC 
Pro Leu Pro Val Ala Val Ala Asp Asp Asp Pro Val Val He Val Gly 
2570 2575 2580 



8561 



30 



ATG GCG TGC CGT TTC CCC GGT GGG GCG GGC ACC CCT GAG GCG CTG TGG 
Met Ala Cys Arg Phe Pro Gly Gly Ala Gly Thr Pro Glu Ala Leu Trp 
2585 2590 2595 



8609 



35 



AAG CTG GTG ACC GAG GAG CGT GAC GTC ATA GGC GCC GCG CCC ACC GAC 8657 
Lys Leu Val Thr Glu Glu Arg Asp Val He Gly Ala Ala Pro Thr Asp 
2600 2605 2610 

CGG GGC TGG GAC CTG GAT TCC GTC TAC GAC CCG GAG CCG GGT GTG GCG 8705 
Arg Gly Trp Asp Leu Asp Ser Val Tyr Asp Pro Glu Pro Gly Val Ala 
2615 2620 2625 2630 



GGG AAG ACA TAT GTG CGG GAG GGG GGT TTT CTC CAC GAC GCG GCG GAG 
40 Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala Glu 

2635 2640 2645 



8753 



45 



SO 



TTC GAC GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG GCG GTG GCG ATG 8801 
Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Val Ala Met 
2650 2655 2660 

GAT CCG CAG CAG CGG CTG TTG CTG GAG ACC TCC TGG GAG GCG ATC GAG 8849 
Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala He Glu 
2665 2670 2675 



CGG GCG GGT ATC GAC CCG CAC TCG CTG CAC GGC AGC CGC ACC GGG GTA 
Arg Ala Gly lie Asp Pro His Ser Leu His Gly Ser Arg Thr Gly Val 
2680 2685 2690 



8897 



TAC GTC GGG CTG ACC CAT CAG GAG TAC GCC TCC CGG CTG CAC GAG GCC 
55 Tyr Val Gly Leu Thr His Gin Glu Tyr Ala Ser Arg Leu His Glu Ala 

2695 2700 2705 2710 



8945 
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CCG GAG GAG TAC GAA GGC TAT CTG CTC ACC GGC AAG TCG GCG AGC GTC 
Pro Glu Glu Tyr Glu Gly Tyr Leu Leu Thr Gly Lys Ser Ala Ser Val 
2715 2720 2725 

GTC TCC GGC CGC ATC TCG TAC ACG CTC GGG CTG GAG GGT CCT TCG CTC 
Val Ser Gly Arg lie Ser Tyr Thr Leu Gly Leu Glu Gly Pro Ser Leu 
2730 2735 2740 

TCC ATC GAC ACC GCG TGT TCG TCG TCG CTG GTC GCC CTG CAC AAC GCG 
Ser He Asp Tfcr Ala Cys Ser Ser Ser Leu Val Ala Leu His Asn Ala 
2745 2750 2755 

GCG CAG GCG TTG CGG GGT GGC GAG TGT GAC ATG GCG TTG GCC GGT GGT 
Ala Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala Leu Ala Gly Gly 
2760 2765 2770 

GTG ACG GTC ATG GCG GCA CCC GGA TTG TTC GTC GAG TTT TCG CGG CAG 
Val Thr Val Met Ala Ala Pro Gly Leu Phe Val Glu Phe Ser Arg Gin 
2775 2780 2785 2790 

CGG GGG TTG GCG GCC GAT GGG CGG TGC AAG GCG TTC GCG GAT GGG GCG 
Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Asp Gly Ala 
2795 2800 2805 

GAT GGG ACC GCT TOG GCC GAG GGT GCG GGG GTC GTC CTC GTC GAG CGG 
Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val Leu Val Glu Arg 
2810 2815 2820 

TTC TCG GAT GCC CGG CGG TTC GGG CAT CCG GTC TTC GCG GTC GTC TGT 
Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu Ala Val Val Cys 
2825 2830 2835 

GGG TCG GCG GTC AAT CAG GAC GGT GCG TCG AAT GGT TTC ACG GCG CCG 
Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
2840 2845 2 850 

ACT GGT CCG TCG CAG GAG CGG GTC ATT CGT CAG GCG TTG GCG AAT GCG 
Ser Gly Pro Ser Gin Glu Arg Val He Arg Gin Ala Leu Ala Asn Ala 
2855 2860 2865 2870 

CGG TTC ACG GTC GCG GAT GTC GAT GTC GTC GAG GCG CAT GGG ACG GGG 
Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly 
2875 2880 2 885 

ACG CGG CTC GGT GAT CCG ATC GAG GCG CAG GCG TTC CTC GGG ACG TAT 
Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Gly Thr Tyr 
2890 2895 2900 

GGC CGG GAT CGT GAT GCT GAG TCT CCG GTC TCG TTC GGG TCG TTC AAG 
Gly Arg Asp Arg Asp Ala Glu Cys Pro Val Trp Leu Gly Ser Leu Lys 
2905 2910 2915 

TCG AAT ATT GGT CAT GCT CAG GCG GCT GCG GGG GTC GCT GGT GTC ATC 
Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Ala Gly Val He 
29 20 2925 2930 

AAG ATC GTG TTG GOG ATG CGG TAT GGG TCG TTC CCG CGG ACG TTC CAT 
Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro Arg Thr Leu His 
2935 2940 2945 2950 
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GTG GAT GAG COG AGC CGG CAT GTG GAC TGO TCG GCT GGT GGT GTS COG 
Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala Gly Gly Val Arg 
2955 2960 2965 

TTG CTG ACC GAG GCG CGG GAG TGG CCG GGG GTG GAC CGG CCG CGT CGG 
Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp Arg Pro Arg Ara 
2970 2975 2980 

GCG GCG GTC TCC GCC TTC GGT GTC ACT GGT ACC AAC GCC CAT CTG ATC 
Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Leu lie 
2985 2990 2995 

CTC GAA GCC CCC GAA GCC CTC GAA GCC CTC GAA GCC ACC GAC GCC CCC 
Leu Glu Ala Pro Glu Ala Leu Glu Ala Leu Glu Ala Thr Asp Ala Pro 
3000 3005 3010 

GAA GCC CCC GAA GCC CCC GAA GCC CCC GAC GTC ACC GAC GTC ACC GAA 
Glu Ala Pro Glu Ala Pro Glu Ala Pro Asp Val Thr Asp Val Thr Glu 
3015 3020 3025 3030 

GCC CTC GAA GCC CCC GAC GCC ACC GAG GCG GAG GGT GCG AAG GCT CCT 
Ala Leu Glu Ala Pro Asp Ala Thr Glu Ala Glu Gly Ala Lys Ala Pro 
3035 3040 3045 

GGC ACT CCC GAA GAG CCA CAG CCT GCT GTG GGT GTG GTG CCG GTG GTG 
Cly Ser Pro Glu Glu Ala Gin Pro Ala Val Gly Val Val Pro Val Val 
3050 3055 3060 

GTT TCG GGG CGT TCG CCG GTG GTG GTG CGG GAG GCT GCG GGC CGG TTG 
Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu 
3065 3070 3075 

GCG GAG GTG GTG GAG GCC GGT GGT GTC GGG CTG GCG GAT GTG GCG GTG 

™ Val Val G1U Ala Gly Val Gl V Ala Asp Val Ala Val 
3080 3085 3090 

ACC ATG GCG GGC CGG TOG CGG TTT GGG TAT CGG GCG GTT GTC CTC GCT 
Thr Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala 
3095 3100 3105 3no 

Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly 
3115 3120 3125 

GAT CCG GAC GCG GGT CTG GTC ACC GGT GCG GTG GTG GAC CCG GAG ACG 
Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr 
3130 3135 3140 

GGG TCC GGT GGT GGG GGC GTG GTG TTG GTT TTC CCT GGT CAG GGG ACG 
Gly Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr 
31*5 3150 3155 

CAG TCG GTG GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT 

Tf?« V41 Gly Met Gly Ala Gly Leu ^ G1 V Ser Ser Glu Val Phe 
3160 3165 3170 

CCG GCG TCG ATG CGG GAG TCT GCG COG GCG CTG ACT GTT CAT GTG GAG 
Ala Ala ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Glu 
3175 3180 3i a5 319Q 
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TGG GAT TTG CTG GAG GTG GTG TCG GGC GGG GCC GGG TTG GAG CGG GTG 
Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val 
3195 3200 3205 

GAT GTG GTG CAG CCC GTG ACG TGG GCG GTG ATG GTG TCG CTG GCC CGG 
Asp Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg 
3210 3215 3220 

TAC TGG CAG GCG ATG GGT GTG GAC GTG GCT GCG CTG CTG GGT CAT TCC 
Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser 
3225 3230 3235 

CAG GGG GAG ATC GCT GCT GCC ACG GTG GCG GGG GCG TTG TCG CTG GAG 
Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu 
3240 3245 3250 

GAT GCG GCG GCT GTG GTC GCT CTG CGG GCG GGG TTG ATT GGC CGG TAT 
Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr 
3255 3260 3265 3270 

CTG GCG GGT CGT GGT GCG ATG GCG GCT GTT CCG CTG CCT GCC GGC GAG 
Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu 
3275 3280 3285 

GTC GAG GCC GGG CTG GCG AAG TGG CCC GGA GTA CAG GTA GCC GCG GTC 
Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Gin Val Ala Ala Val 
3290 3295 3300 

AAC GGT CCG GCG TCC ACG GTG GTT TCC GGG GAT CGG CGG GCG GTG GCC 
Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala 
3305 3310 3315 

GGT TAT GTG GCC GTC TGT CAG GCG GAG GGT GTG CAG GCT CGG TTG ATA 
Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He 
3320 3325 3330 

CCG GTG GAC TAC GCC TCT CAC TCC CGC CAT GTG GAG GAC CTG AAG GGC 
Pro Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly 
333 5 3340 3345 3350 

GAG TTG GAG CGG GTG CTG TCC GGT ATC CGC CCC CGC AGT CCG CGG GTG 
Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val 
3355 3360 3365 

CCG GTG TGT TCC ACC GTC GCC GGA GAG CAG CCG GGC GAG CCG GTT TTC 
Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe 
3370 3375 3380 

GAT GCG GGG TAT TCG TTC CGT AAT CTG CGG AAC CGG GTT GAG TTC TCC 
Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser 
3385 3390 3395 

GCG GTG GTC GGT GGT TTG TTG GAG CAG GGC CAC CGT CGG TTC ATC GAG 
Ala Val Val Gly Gly Leu Leu Glu Gin Gly His Arg Arg Phe He Glu 
3400 3405 3410 

GTC AGT GCC CAC CCG GTA CTC GTC CAT GCC ATT GAG CAG ACG GCC GAG 
Val Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu 
3415 3420 3425 3430 
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GCC GCG GAC CGG ACT GTC CAT GCC ACC GGA ACC CTG CGC CGC CAG GAC 11153 
Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp 
3435 3440 3445 

GAC AGC CCG CAC CGC CTG CTG ACC TCC ACC GCC GAG GCC TGG GCC CAC 11201 
Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His 
3450 3455 3460 

GGC GCC ACC CTC ACC TGG GAC CCC GCC CTG CCC CCA GGT CAC CTC ACC 11249 
Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr 
3465 3470 3475 

ACC CTC CCC ACC TAC CCC TTC AAC CAC CAC CAC TAC TGG GCC GTG ACA 11297 
Thr Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Ala Val Thr 
3480 3485 3490 
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TCC CCC GCC GGA GTC GGC GAC GCG GCT GCG GGC CGG TTC GGT ATG ACC 11345 
Ser Pro Ala Gly Val Gly Asp Ala Ala Ala Gly Arg Phe Gly Met Thr 
3495 3500 3505 3510 

TGG GAG GAC CAC CCC TTC CTC CGT GGC GGG TTA CCC CTG GCC GAC TCC 11393 
Trp Glu Asp His Pro Phe Leu Arg Gly Gly Leu Pro Leu Ala Asp Ser 
3515 3520 3525 

GGT GAG CGG GTG TTC GCC GGG CGG CTG GCG GGC TCC GAG CAC GAC TGG 11441 
Gly Glu Arg Val Phe Ala Gly Arg Leu Ala Gly Ser Glu His Asp Trp 
3530 3535 3540 

CTG ACG GAC CAT GCC CTG TCC GGG GTG ACG TTC CTG CCG GGT ACG GCC 11489 
Leu Thr Asp His Ala Val Ser Gly Val Thr Leu Leu Pro Gly Thr Ala 
3545 3550 3555 

TTC GTG GAG TTC GCG CTG CAC GCG GGA GCC GCC ACC GGC TGC GGG CGG 11537 
Phe Val Glu Phe Ala Leu His Ala Gly Ala Ala Thr Gly Cys Gly Arc 
3560 3565 3570 

CTG GAA GAG CTG AGC GTT GAG GCG CCG TTC CTC TTG CCC GCC GCC GGT 11585 
Leu Glu Glu Leu Ser Val Glu Ala Pro Leu Val Leu Pro Ala Ala Gly 
3575 3580 3585 3590 

GGT GTG CGG GTG CAG ATG AGG GTG TOG GCC GCC GAC GAG TCG GGA CGG 11633 
Gly Val Arg Val Gin Met Arg Val Ser Ala Ala Asp Glu Ser Gly Arg 
3595 3600 3605 

CGG AGG GTC GCC ATC CAC TCG GCC CCG GAA GCC GCC GTC CAC TCG GCC 11681 
Arg Arg Val Ala He His Ser Ala Pro Glu Ala Ala Val His Ser Ala 
3610 3615 3620 

GCA GAA GGC GGC GAC TCG GCC GGT GTC TGG ACG CGG CAC GGC GAG GGC 11729 
Ala Glu Gly Gly Asp Ser Ala Gly Val Trp Thr Arg His Gly Glu Gly 
3625 3630 3635 

ACG CTC GTG CCG GAC CCG GAG CCC ACG CCT CCG GAC GCC GAC TGG CCG 11777 
Thr Leu Val Pro Asp Pro Glu Pro Thr Pro Pro Asp Ala Asp Trp Ala 
3640 3645 3650 

CGG GCC TGG CCG CCC GCC GGG GAA CGC GTC GAA CCG GCC GAG CTC TAC 11825 
Arg Ala Trp Pro Pro Ala Gly Glu Arg Val Glu Pro Ala Glu Leu Tyr 
3655 3660 3665 3670 
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GAA CGG TTC GGG GCC CTG GGC TAC GAG TAC GGT GAG GCG TTC GCG GGC 
Glu Arg Phe Gly Ala Leu Gly Tyr Glu Tyr Gly Glu Ala Phe Ala Gly 
3675 3680 3685 

5 

GTG CGC GCC GTA TGG CGG CAG CCG GAC GCG CTG CTC GCC GAG GTG CTC 
Val Arg Ala Val Trp Arg Gin Pro Asp Ala Leu Leu Ala Glu Val Leu 
3690 3695 3700 

10 CTG CCC GAC CGG GCC TCG ACC GGT GCC GGC CGG TTC GGT GTG CAC CCC 

Leu Pro Asp Arg Ala Ser Thr Gly Ala Gly Arg Phe Gly Val His Pro 
3705 3710 3715 
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GCG CTG CTG GAC GCG GCG CTG CAG CCG TGG ATC GCC GGT GGT CTC CTC 
Ala Leu Leu Asp Ala Ala Leu Gin Pro Trp He Ala Gly Gly Leu Leu 
3720 3725 3730 

GAA GTG CCG GAG GAC GCA GTG CTG CTG CCC TTC GCC TGG CAG GGA GTG 12065 
Glu Val Pro Glu Asp Ala Val Leu Leu Pro Phe Ala Trp Gin Gly Val 
3735 3740 3745 3750 

TCG CTC TAC GCG ACG GGT GCC GGT GCT CTG CGG GTG CGG CTG ACG AAG 12113 
Ser Leu Tyr Ala Thr Gly Ala Gly Ala Leu Arg Val Arg Leu Thr Lys 
3755 3760 3765 

GCG GGT GAC GGG GCG GTC TCG CTC CAG GCC GCA GAC ACG TCC GGC GCG 12161 
Ala Gly Asp Gly Ala Val Ser Leu Gin Ala Ala Asp Thr Ser Gly Ala 
3770 3775 3780 

GCC GTG CTC TCC TTG GGG GCC CTG GTG ATG CGT CCG CTG GCG CGC CGG 12209 
Ala Val Leu Ser Leu Gly Ala Leu Val Met Arg Pro Leu Ala Arg Arg 
3785 3790 3795 

AAG CTG GAC GTG CTG CTC GGC ACG GAC GCC GGC GAA CGG TCG CTG TAC 12257 
Lys Leu Asp Val Leu Leu Gly Thr Asp Ala Gly Glu Arg Ser Leu Tyr 
3800 3805 3810 

CGC GTC GAG TGG CAG CCG CGG CTC CTG CCC GCC GGC CCG CCG CGC TCC 12305 
Arg Val Glu Trp Gin Pro Arg Leu Leu Pro Ala Gly Pro Pro Arg Ser 
3815 3820 3825 3830 

TGG GCG GTG CTC GGC CCC GAC GCG GAC CGG CTC GCC GGG ACG CCG GGC 12353 
Trp Ala Val Leu Gly Pro Asp Ala Asp Arg Leu Ala Gly Thr Pro Gly 
3835 3840 3845 

CTG GGG GAT CAG CCG GAC GGT GGG CCC ACC GCG CTG TAC CCG GAG GTG 12401 
Leu Gly Asp Gin Pro Asp Gly Gly Pro Thr Ala Leu Tyr Pro Glu Val 
3850 3855 3860 

CGG GCG CTG CGG AAG GCG CTG GCG GCG GGC GCG CCG CGG CCG GAA GCG 12449 
Arg Ala Leu Arg Lys Ala Leu Ala Ala Gly Ala Pro Arg Pro Glu Ala 
3865 3870 3875 

GTC GTA CTG CCG GTG CTC TCC GGG GCC GGG GCC ACT CCG GAG TCG GTG 12497 
Val Val Leu Pro Val Leu Ser Gly Ala Gly Ala Thr Pro Glu Ser Val 
3880 3885 3890 

CGG CAG ACA ACG GAG CGC TGT CTG ACC GCG CTC CAG GAC TGG CTG GAC 12545 
Arg Gin Thr Thr Glu Arg Cys Leu Thr Ala Leu Gin Asp Trp Leu Asp 
3895 3900 3905 3910 
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GCC GAG GAG TTC GTC GAC ACA CCG CTC ATA CTC CTC ACC AGG GGA GCC 
Ala Glu Glu Leu Val Asp Thr Pro Leu lie Val Leu Tfcr Arg Gly Ala 
3915 3920 3925 

?™ f: C <* C J™ CCG GGG GAG GAG ATC GGG GAC CTC GCG TCT GCG GGG 
Val Ala Ala Val Pro Gly Glu Glu lie Gly Asp Leu Ala Cya Ala Gly 
3930 3935 3 9 4o * 

CTC TCG GGC CTC GTC AGG TCC GCG CGG TCC GAG CAC CCG GGC CGC TTC 
Val Trp Gly Leu Val Arg Ser Ala Arg Ser Glu His Pro Gly Arg Phe 
3945 3950 3955 

GCC CTC GTC GAC ACC GAC GGG CAT CCG GAC GAC CGC ACC GCG CTC CCC 

™c« SP Thr ASP Gly His Pro ^9 Thr Ala Uu Pro 

3960 3965 3970 

CTC GCG CTC CGC GCG GTC CTC GAC GGC GCC GGC CAG CTC TCC CTG CGG 
Leu Ala Leu Arg Ala Val Leu Asp Gly Ala Gly Gin Leu Ser Leu Arg 
3975 39 *° 3985 3990 

?f GGC ACC GCC CGG ACC CCG GTC CTC CTC CGG GCC GGG ACC CCG GAG 
Ala Gly Thr Ala Arg Thr Pro Val Leu Leu Arg Ala Gly Thr Pro Glu 
3995 4000 4005 

GAG CAG CGG GOT CCG GCA TTC GAC CCC GCG GGC ACG GTC CTG GTC ACG 
Glu Gin Arg Gly Pro Ala Phe Asp Pro Ala Gly Thr Val Leu Val Thr 
4010 4015 4020 

^° *f C ^ ACG 010 008 000 010 °™ CGG CAT CTG GCC GCC 

Gly Ala Thr Gly Thr Leu Gly Arg Leu Leu Ala Arg Hia Leu Sa Sa 
402S 4030 4035 

GAG CAC GGT GTG CGC CAT CTG CTG CTG CTG AGC CGC GGC GGC CGG GCT 
Glu His Gly Val Arg His Leu Leu Leu Leu Ser Arg Gly SI 
* U4U 4045 4050 

Ala Glu Gly Ala A.p Glu Leu Ala Ala Glu Leu Ala Gly 2 SI £ 
4055 4060 4065 4070 

r^\ G '*T C GCG GCC ^C GCG GCG GAC CGC GAG GCC CTG GCA 

Glu Pro Cys Phe Ala Ala Cy. Asp Ala Ala Asp Arg Glu Ala Leu Ala 
4075 408O 408S 

CGG GTG CTG GCG GAG GTG CCG GCC GAC CGG CCG CTG ACC GGA GTG ATC 
Arg Val Leu Ala Glu Val Pro Ala Asp Arg Pro Leu Thr Gly Val 55 
4090 4095 4100 

CAC GCG GCC GGG GTG CTC GAC GAC GGC ACA CTC GAC GCG CTG ACC CCG 

1 0,° ly V<l1 ^ ** P ° ly ^ Asp Ala SutS S 

4105 4110 4U5 

GAA CGG ATC GGT ACC GTC ATG CGG CCG AAG GCG GAC GCC GCG CKS JLfcr 
Glu Arg tie Gly Thr Val Met Arg Pro Lys Ala aJ P £ £a 2 i£ 

* A ^ U 4125 4130 

CTG CAC GAA CTC ACC CGG ACC AGC CCG CTC TCG GTC TTC GCG GTC 
Leu hx. Glu Leu T*r Arg l*r Ser Pro Leu Ser vS K Sa S ™ 
4iJb 4140 4145 4150 
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TCG GGC GCT GCC GGC ATC CTG GGC CGC CCC GGA CAG GCC AAC TAC GCC 13313 
Ser Gly Ala Ala Gly lie Leu Gly Arg Pro Gly Gin Ala Asn Tyr Ala 
4155 4160 4165 

GCC GCC AAC ACC TTC CTC GAC GCG CTC GCG CAG CAC CGC CGC GCC CAC 13361 
Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala Gin His Arg Arg Ala His 
4170 4175 4180 

10 GGC CTC CCC GCC GTG TCG CTG GCC TGG GGG CTG TGG GGC GGG GCG ACC 13409 

Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Gly Gly Ala Thr 
4185 4190 4195 

GGC ATG ACC GGC CAT CTG TCC GGC ACC GAT CTG CGC CGG ATC CGC AGG 13457 
Gly Met Thr Gly His Leu Ser Gly Thr Asp Leu Arg Arg Met Arg Arg 
,5 4200 4205 4210 

TCC GGT ATC GCG CCG ATC ACC CAC GAC CAG GGG CTC GCC CTC TTC GAC 13505 
Ser Gly He Ala Pro Met Thr His Asp Gin Gly Leu Ala Leu Phe Asp 
4215 4220 4225 4230 

20 

CGA GCG CTC GCC GCC TCG GCC GAG GAC CCG CTC CTC GTA CCG ATC CGG 13553 
Arg Ala Leu Ala Ala Ser Ala Glu Asp Pro Leu Leu Val Pro Met Arg 
4235 4240 4245 

CTC GAC CTC GCC GCC CTC GTC CGG GAG CGG GCC GAG CAC GGG CCG GAC 13601 
Leu Asp Leu Ala Ala Leu Val Arg Glu Arg Ala Glu His Gly Pro Asp 
4250 4255 4260 

GCG GTG CCC GGA CCG CTC CTC GGG CTC CTC CCC GCC CGG GCC GCG GTC 13649 
Ala Val Pro Gly Pro Leu Leu Gly Leu Leu Pro Ala Arg Ala Ala Val 
30 4265 4270 4275 

CGG CAG GCG GCG GCA CCG GTA CCC GGC GGA GCC CCC GCC CCC GCC GGC 13697 
Arg Gin Ala Ala Ala Pro Val Arg Gly Gly Ala Pro Ala Pro Ala Glv 
4280 4285 4290 

35 

GGC GAG GGG ACG GCC GAG CGG CTC CCC GGG CTC GOG GAS GAG GCC AGG 13745 
Gly Glu Gly Thr Ala Glu Arg Leu Ala Gly Leu Gly Glu Glu Ala Arg 
«95 4300 4305 4310 

CTG CGC GAG CTG GTG AGG CTG GTC CGC GCC GAG GTG TCG GGC GTG CTG 13793 
Leu Arg Glu Leu Val Arg Leu Val Arg Ala Glu Val Ser Gly Val Leu 
♦315 4320 4325 

GGC TAC TCG GGC CCG GAC GCG GTG GAG CCC GGG CCC CCG TTC AAG GAT 13841 
Gly Tyr Ser Gly Pro Asp Ala Val Glu Pro Gly Arg Pro Phe Lys Asp 
4330 4335 4340 

CTC GGC TTC GAC TCG CTG ACC GCC GTG GAG CTG CGC AAC CGC CTC GGC 13889 
Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu Glv 
4345 4350 4355 

GCC GCC ACC GGG CTG CGG CTG CCG ACC GCG CTG GTC TTC GAC CGC CCG 13937 
Ala Ala Thr Gly Leu Arg Leu Pro Thr Ala Leu Val Phe Asp Ara Pro 
4360 4365 4370 

ACG TCC CAG GCA GTG GCC GAG TAC CTC GCT GCC CAA CTG GCC GGA CCG 13985 
55 Thr Ser Gin Ala Val Ala Glu Tyr Leu Ala Ala Glu Leu Ala Glv Pro 

4375 4380 4385 * 4390 
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CGG GAC GGC GGC GAC ACC GCG GCC GCC GCG TTC GAG GGC CTG GAG GCG 
Arg Asp Gly Gly Asp Thr Ala Ala Ala Ala Phe Glu Gly Leu Glu Ala 
4395 4400 4405 

CTG GCC GCG GCG GTG GGC GCG CTG GCC GAG GAC GAT CTG CGG CCC GAC 
Leu Ala Ala Ala Val Gly Ala Leu Ala Glu Asp Asp Leu Arg Arg Asp 
4410 4415 4420 

GTG CTC CGG CGG CGA CTG ACC GAA CTG GCC GCC GCC CTC ACC CCG CAG 
Val Leu Arg Arg Arg Leu Thr Glu Leu Ala Ala Ala Leu Thr Pro Gin 
4425 4430 4435 

GGC CGG AAC CCC TCC GCG CCC GCA CCC GCC CCG TCC GAT CTG GAC GAG 
Gly Arg Asn Pro Ser Ala Pro Ala Pro Ala Pro Ser Asp Leu Asp Glu 
4440 4445 4450 

CGG CTG GAC TCC GCG AAC GAC GAC GAC CTC TTC GCC TTC ATC GAG GAG 
Arg Leu Asp Ser Ala Asn Asp Asp Asp Leu Phe Ala Phe He Glu Glu 
4455 4460 4465 4470 

CAG CTT TCA GCAGCGAGAA CGACAGCAGC GAGAACGACG GCGACGACAC 
Gin Leu * 

CGCCGGGGCA GCTCCGGGGA CCGCTCCGGG GGCCTCCCCC CGGCAGGACG ACCGGGTCAG 

GGAGTATCTC AAGCGG GTG ACC GCC GAA CTG GTC GCC ACC CGC AAG CGG 
Met Thr Ala Glu Leu Val Ala Thr Arg Lys Arg 
1 5 io 

CTC GGC GCG CTG GAG GAG CGG GCC CGC GAA CCG ATC GCC GTC GTC GCG 
Leu Gly Ala Leu Glu Glu Arg Ala Arg Glu Pro He Ala Val Val Ala 
15 20 25 

ATG AGC TCC CGC TAC CCG GGC GGG GTG ACG ACC CCC GAG GAC CTG TCC 
Met Ser Cys Arg Tyr Pro Gly Gly Val Thr Thr Pro Glu Asp Leu Tro 
30 35 40 

CGG CTT CTC GCG GAC GAA CGC GAC GCC GTA TCC GGA CTT CCC CGG GAC 
Arg Leu Leu Ala Asp Glu Arg Asp Ala Val Ser Gly Leu Pro Arg Asp 
45 50 55 

CGC GGC TCG GAC CTG GAC GCC CTC TAC GAC CCC GAC GGC GGC CCC GGC 
Arg Gly Trp Asp Leu Asp Ala Leu Tyr Asp Pro Asp Gly Gly Pro Gly 
60 65 70 75 

ACC AGC TAC GCC CGC GAA GGC GGC TTC CTG AGC CAC TCC GCC GGA TTC 
Thr Ser Tyr Ala Arg Glu Gly Gly Phe Leu Ser His Cys Ala Gly Phe 
*° 85 90 

GAC GCG GAG TTC TTC GGC ATC TCC CCG CGC GAG GCG CTC GCG ATC GAC 
Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp 
55 100 105 

CCG CAG CAG CGG CTC CTC CTC GAG ACC TCC TOG GAG GCC CTC GAA CGC 
Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Ara 
"° 115 120 

GCC GGA GTC ACC GCC GAC CGC GCC CGG GGC AGC CGG ACG GGC GTC TAC 
Ala Gly Val Thr Ala Asp Arg Ala Arg Gly Ser Arg Thr Gly Val Tyr 
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125 130 135 

GCG GGC GTC ATG TAC GAC GAC TAC GGC GCC CGG GTC CTG TAC GGC GCC 14815 
Ala Gly Val Met Tyr Asp Asp Tyr Gly Ala Arg Val Leu Tyr Gly Ala 
140 145 150 155 

GGC GCC GGC CCG CCC GAG GAC CTG GAG GGT TAT CTC GTC AAC GGC AGC 14863 
Gly Ala Gly Pro Pro Glu Asp Leu Glu Gly Tyr Leu Val Asn Gly Ser 
160 165 170 

GCG GGC AGC ATC GCC TCC GGC CGT GTC TCC TAC ACG TTC GGG CTC CGC 14911 
Ala Gly Ser He Ala Ser Gly Arg Val Ser Tyr Thr Phe Gly Leu Arg 
175 180 185 

GGC CCC GCG GTC ACC GTC AAT ACG GCC TGT TCG TCG TCA CTG GTG TCG 14959 
Gly Pro Ala Val Thr Val Asn Thr Ala Cys Ser Ser Ser Leu Val Ser 
190 195 200 

CTC CAT CTG GCG GTG CGT GCC CTG CGG AAC GGC GAG TGC GAC ATG GCA 15007 
Leu His Leu Ala Val Arg Ala Leu Arg Asn Gly Glu Cys Asp Met Ala 
205 210 215 

CTG GCC GGC GGG GCG ACG GTG CTG TCC ACC CCC ACC GTG CTC GTG GAC 15055 
Leu Ala Gly Gly Ala Thr Val Leu Ser Thr Pro Thr Val Leu Val Asp 
220 225 230 235 

TTC TCC CGC CAG CGC GGT CTG GCC CCC GAC GGC CGC TGC AAG GCG TTC 15103 
Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe 
240 245 250 

GCC GAC TCC GCC GAC GGC ACC TCC TGG GCC GAG GGC GCC GGA ATG CTG 15151 
Ala Asp Ser Ala Asp Gly Thr Ser Trp Ala Glu Gly Ala Gly Met Leu 
255 260 265 

CTG CTC CAG CGG CTG TCC GAC GCC CGC CGC GAG GGG CGC CCC GTG CTG 15199 
Leu Leu Gin Arg Leu Ser Asp Ala Arg Arg Glu Gly Arg Pro Val Leu 
270 275 280 

GCC GTC ATT CGC GGC TCG GCC GTC AAC CAG GAC GGC GCC AGC AAC GGA 15247 
Ala Val 11m Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly 
285 290 295 

CTG ACC GCC CCC AAC GGG CGG GCG CAG CGG CAG GTC ATC GAG GAC GCG 15295 
Leu Thr Ala Pro Asn Gly Arg Ala Gin Arg Gin Val He Glu Asp Ala 
300 305 310 315 

CTG CGC GAC GCC GGG GTC GGC CCC GAC CAG GTG GAC GCG GTC GAG GCG 15343 
Leu Arg Asp Ala Gly Val Gly Pro Asp Gin Val Asp Ala Val Glu Ala 
320 325 330 

CAT GGC ACC GGT ACC GAG CTG GGC GAC CCC ATC GAG GCC GGG GCG CTC 15391 
His Gly Thr Gly Thr Glu Leu Gly Asp Pro He Glu Ala Gly Ala Leu 
335 340 345 

CTC GCC ACC TAT GGA ACG GCC CGT ACG GCG GAG CGC CCG CTC TGG CTC 15439 
Leu Ala Thr Tyr Gly Thr Ala Arg Thr Ala Glu Arg Pro Leu Trp Leu 
350 355 360 



55 000 TCC GTS AA° TCC AAC ATC GGG CAC ACC CAG GCC GCC GCC GGT GTT 

Gly Ser Leu Lys Ser Asn He Gly His Thr Gin Ala Ala Ala Gly Val 
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365 370 375 

GCG GGC GTC ATC AAG ATG GTG CTG GCG ATG CGG CAC GGC CGG CTG CCC 15535 
Ala Cly Val lie Lys Met Val Leu Ala Met Arg His Gly Arg Leu Pro 
380 385 390 395 

CCC ACC CTG CAC GTG GAC CGG CCC ACC ACC CGG GTG GAC TGG GAG AAG 15583 
Arg Thr Leu His Val Asp Arg Pro Thr Thr Arg Val Asp Trp Glu Lys 
400 405 410 

GGC GGG GTG CGG CTC CTC ACG GAG CCG GTG CCA TGG CCG GGG GAA GCG 15631 
Gly Gly Val Arg Leu Leu Thr Glu Pro Val Pro Trp Pro Gly Glu Ala 
415 420 425 

GGG GAG CCG CGT CGC GCG GGC GTG TCC TCC TTC GGC GCG AGC GGC ACG 15679 
Gly Glu Pro Arg Arg Ala Gly Val Ser Ser Phe Gly Ala Ser Gly Thr 
430 435 440 

AAC GCG CAT GTG GTG CTG GAG AGC GTC CCG GCC GGT GAA CCG CCC GCC 15727 
Asn Ala His Val Val Leu Glu Ser Val Pro Ala Gly Glu Pro Pro Ala 
445 450 



455 



GCC GGG CGG CCG GAG GAC ACA GGC GGC GCC TGG ACG GTC AGC GGC CGC 
Ala Gly Arg Pro Glu Asp Thr Gly Gly Ala Trp Thr Val Ser Gly Arg 
460 465 470 a 



15775 



475 



15823 



GGC CCG GCG GCC CTG CGC GCC CAG GCC GCC CGG CTG TAC GAC CCG CTC 
Gly Pro Ala Ala Leu Arg Ala Gin Ala Ala Arg Leu Tyr Asp Ala Leu 
480 485 490 

ACC GGC ACC GGC ACC GGC ACC GGA CAC GGC GCC GGA CAG GGC GCC GGA 15871 
30 Thr Gly Thr Gly Thr Gly Thr Gly Gin Gly Ala Gly Gin GW Ala Gly 

495 500 505 

^ C ^ ^ C ^ C ^ CCC GGC GCC CTG CCC CAC GCC CGT ACC CCC 

Pro Gly Thr Ala Glu Val Ala Gly Ala Leu Ala His Ala Arg Thr Ala 
35 510 515 520 

TOO3G«CC00aCTO<W3CTC«W«JAMCC(XWGAACTCCTC 15967 
Phe Arg Hi. Arg Ala Val Val Leu Gly Gly Asn Arg Ala Glu Leu Leu 

52S 530 535 



15919 



GCG OK CTG CGC GAG CTG GCG GAG GAG GAG CAT CCC GGA CCC CGC GTG 
AU Gly Leu Arg Glu Leu Ala Glu Glu Glu His Pro Gly Pro Arg Val 
b4 ° 545 550 555 



GTC ACA GGG ACC GCC CCG GCC ACC GAG CGG CGG ACG GCC TTC CTC TTC 
« Val Thr Gly Thr Ala Pro Ala Thr Glu Arg Arg Thr Ala Phe Leu Phe 

560 565 



570 



TCC GGG CAG GGC AGC CAG CGG GCC GGC TCC GGC CGG GGC CTG TAC CGC 
Ser Gly Gin Gly Ser Gin Arg Ala Gly Ser Gly Arg Gly Leu Tyr 2g 
5? 5 580 



585 



CGC CAC CCG GTC TTC GCC CGC GCC CTC GAC GAG GTG TGC GCC GCG CTC 
Arg His Pro Val Phe Ala Arg Ala Leu Asp Glu Val Cys Ala Ala Leu 
590 595 600 

55 1°° 5 AT f™ ^ CGC CCC CTC GAC CTC ATC TTC GCC GAG CCC 

Glu Pro His Leu His Arg Pro Leu Arg Asp Leu Met Phe Ala Glu Pro 
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ATC GCC GCG CTC CGC GCC ACC GCC GAG GAG ATC GCA CCA CTC CTC GAG 
Met Ala Ala Leu Arg Ala Thr Ala Glu Glu He Ala Pro Leu Leu Glu 
•' U0 7 °5 710 715 

CGC CGC GCG GGC GAA CTG GCG CTC GCC GCC GTC AAC GCC CCC TCC TCG 
Arg Arg Ala Gly Glu Leu Ala Leu Ala Ala Val Asn Gly Pro Ser Ser 
720 725 730 



TTC CAT TCA CCG CGG ATC GAC GGA ATG TTG ACA CAG TTC GAC CGG GTC 
Phe Hxs ser Pro Arg Met Asp Gly Met Leu Thr Gin Phe Asp Arg Val 

' 65 770 775 

OCT CGC ACC CTG ACQ TTC GCT CCG CCG ACC ATT CCC CTC GTC TCC ACC 
Alj Arg Thr Leu Thr Phe Ala Pro Pro Thr lie Pro Leu Val Ser Thr 



785 790 



795 



CTC ACC CCT ACC CCC GTC ACC GAG GAA ACC CTG TCC ACC GCG GAC CAC 
Leu Thr Gly Thr Pro Val Thr Glu Glu Thr Leu Cys Thr Ala £p 2S 
800 805 



810 



TGG GTC CGC CAG GCG CGC GAG CCG GTC CGC TTC CTG GAC GCC ATG CGG 
Trp Val Arg Gin Ala Arg Glu Pro Val Arg Phe Leu Asp £a £? aS 
815 820 



825 



ACC CTG CGC GCC GAC GGG ATC GAC ACC TTC GTC GAA CTC GGC CCC GAC 
Thr Leu Arg Ala Asp Gly He Asp Thr Phe Val G £ Leu Gly So Sp 
830 835 840 

55 Z£ v!T , CTC T C ^ ATC GCC 080 GAC TCC GCG GAC GAC CGG CCC GAT 

Gly Val Leu Ser Ala Met Ala Arg Asp Cys Ala Asp Asp Arg Pro Asp 
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605 610 615 

^ f° ff° ~? ^ °*° CCG ^ 0AC CGC ACC GAG TTC ACC CAG CCC 16255 
Cly Ser Pro Glu Ala Glu Pro Leu Asp Arg Thr Glu Phe Thr Gin Pro 

620 "5 630 635 

GCG CTC TTC GCG CTC CAG ACC GCC CTG TTC CGG CTG GCC GAG CAC CAC 
Ala Leu Phe Ala Leu Gin Thr Ala Leu Phe Arg Leu Ala Glu His His 
640 645 650 

GGG CTG CGC GCC GAG GCG CTG TCC GGG CAC AGC GTC GGC GAG ATC GCG 16351 
Gly Leu Arg Ala Glu Ala Leu Cys Gly His Ser Val Gly Glu lie Ala 
655 660 665 

GCC GCC CAT GCC GCC GGT GTC CTC ACC CTG CCC GAC GCG GCC CCT CTG 16399 
Ala Ala His Ala Ala Gly Val Leu Thr Leu Pro Asp Ala Ala Arg Leu 
670 675 680 

™ S ST f°° ?* ATG CAG 000 CCG GCC GGC GGT GCC 16447 

Val Ala Ala Arg Gly Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Ala 
20 685 690 695 



16495 



16543 



SaT S? vT? T G GAC GAG GCC GCC GTC CTG GAG CTA CTG GAG CAG 16591 

30 Val Val Val Ser Gly Asp Glu Ala Ala Val Leu Glu Leu Leu Glu Gin 

735 740 745 

TGG CGG GCC GAG GGC CGC GAG GCC AGG CGG CTC GCC GTC AGC CAT GCC 
Trp Arg Ala Glu Gly Arg Glu Ala Arg Arg Leu Ala Val Ser £i J 2 

35 ° 755 760 
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845 850 855 

GGC GAC ACA ACC GGG GCC GGG GAC GGG GAG ACC CCC GAT CCG CTG CTC 16975 
Gly Asp Thr Thr Gly Ala Gly Asp Gly Glu Thr Pro Asp Pro Lou Leu 
860 865 870 875 

ACC CTC CCG CTG CTG CGC CGC TCC GTG CCC GAG ACC GGC GAC GCC GAA 17023 
Thr Leu Pro Leu Leu Arg Arg Ser Val Pro Glu Thr Gly Asp Ala Glu 
880 885 890 

CAC CCC GGC GGC TTC GAA CGG GCC CTG GCC ACC GCC TAC GCA CAC GGC 17071 
His Pro Gly Gly Phe Glu Arg Ala Leu Ala Thr Ala Tyr Ala His Gly 
895 900 905 

GTC CCC CTG CGG CTC GCG CCC GCC CCC GAC GCC GCG TCC CTC GCC GTG 17119 
Val Pro Leu Arg Leu Ala Pro Ala Pro Asp Ala Ala Ser Leu Ala Val 
910 915 920 

GCC GCC GAA CTG CCC ACC TAC GCC TTC CAG CGC ACC CAC TAC TOG CTC 17167 
Ala Ala Glu Leu Pro Thr Tyr Ala Phe Gin Arg Thr His Tyr Trp Leu 
925 930 935 

GAC GCG CCC GCC GCC CCC GCC GCC CTC CCC GCC GGG CTC GAC GAC GCC 17215 
Asp Ala Pro Ala Ala Pro Ala Ala Leu Pro Ala Gly Leu Asp Asp Ala 
945 950 955 

GGT CAC CCG CTG CTC fCC GCG GCC CTC GAC CTG CCC GGC GGA CGC GGA 17263 
Gly His Pro Leu Leu Ser Ala Ala Leu Asp Leu Pro Gly Gly Arg Gly 
960 965 970 

ACG GTG TOG ACC GGA GCG CTC TCC GCG GCC ACC CTG CCC TOG GCC GCG 17311 
Thr Val Trp Thr Gly Ala Leu Ser Ala Ala Thr Leu Pro Trp Ala Ala 
975 980 985 

GAC CAC AGC GTG CAC GGC CGC ACC GTC CTG CCG GGC ACC GCC CTC CTC 17359 
Asp His Ser Val His Gly Arg Thr Val Leu Pro Gly Thr Ala Leu Leu 
990 995 1000 

GAC CTG GCG CTC CAC GCC GCC CCG CGC GTC GGC GAG TTG ACC TTC GAG 17407 
Asp Leu Ala Leu His Ala Ala Pro Arg Val Gly Glu Leu Thr Phe Glu 
1005 1010 1015 

GCG CCG CTG GTG CTG CCG GAG GAC GGA GAG GTC CGG CTC CGC GTC GTA 17455 
Ala Pro Leu Val Leu Pro Glu Asp Gly Glu Val Arg Leu Arg Val Val 
1020 1025 1030 1035 

CTC GCT GAA CCG GAC GCG AGC GGA GTA CGC GAA CTC TCT GTC CAC TCC 17503 
45 Leu Ala Glu Pro Asp Ala Ser Gly Val Arg Glu Leu Ser Val His Ser 

1040 1045 1050 

GCC GGC GAG GAC GGC GGC TGG ACC CGG CAC GCG ACA GCG GTC CTG GAC 17551 
Ala Gly Glu Asp Gly Gly Trp Thr Arg His Ala Thr Ala Val Leu Asp 
1055 1060 1065 

50 

ACC GGC ACC ACC ACC GCC GGG GAG CCC GCC GGC GCA CCG CCC GCC GCA 17599 
Thr Gly Thr Thr Thr Ala Gly Glu Pro Ala Gly Ala Pro Pro Ala Ala 
1070 1075 1080 

55 TGG CCG CCC GGG GAC GCC GAA CCC CTC GAC CTT GCC GCC GAG TAC GAG 17647 

Trp Pro Pro Gly Asp Ala Glu Pro Leu Asp Leu Ala Ala Glu Tyr Glu 
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1085 1090 1095 

CGC TTC GCC GAC GCC GGC ATC GGA TAC GGC CCC GCC TTC CGC GGA CTG 
Arg Phe Ala Asp Ala Gly He Gly Tyr Gly Pro Ala Phe Arg Gly Leu 
1100 H05 1110 1115 

CCC TCC GCC TGG CGC GAC GGC GAC GCG ATA CTG GCC GAC GTA CGG CTG 
Arg Ser Ala Trp Arg Asp Gly Asp Ala He Leu Ala Asp Val Arg Leu 
1120 1125 H30 

CCC GGC GAA CTG GCC GGC GAA GCC GAC CGG TAC GGC ATC CAC CCG GCC 
Pro Gly Glu Leu Ala Gly Glu Ala Asp Arg Tyr Gly He His Pro Ala 
1135 1140 H45 

CTG CTC GAC GCC GCC CTG CAC ACC GCG GCC GCC GCC CTG GGA GGG GCG 
Leu Leu Asp Ala Ala Leu His Thr Ala Ala Ala Ala Leu Gly Gly Ala 
1150 1155 H60 

CAC GGG ATG CTG CCG TTC ACG TGG AAC GGC GTC ACC CTC CAC GCC CGC 
His Gly Met Leu Pro Phe Thr Trp Asn Gly Val Thr Leu His Ala Arg 
1165 H70 H75 

GGA GCG CAC GCC ATC CGG GTG CGG CTG ACC CCG GCC GGC CCC GAC GCG 
Gly Ala His Ala He Arg Val Arg Leu Thr Pro Ala Gly Pro Asp Ala 
1180 H85 H90 H95 

GTC GCG GTC ACC GCC GTG GAC CCG GCG GGG CGC CCC GTG TTC ACG GCC 
Val Ala Val Thr Ala Val Asp Pro Ala Gly Arg Pro Val Phe Thr Ala 
1200 1205 1210 

GCC TCG CTC ACC CTG CGA CCG GTC ACG ACC GGG CAG CTG ACC GCG GCC 
Ala Ser Leu Thr Leu Arg Pro Val Thr Thr Gly Gin Leu Thr Ala Ala 
1215 1220 1225 

GAG GCC GCG CGG GCC CCG CTG TAC CGG GTG CGC TGG ACC GGC CTC CCG 
Glu Ala Ala Arg Ala Pro Leu Tyr Arg Val Arg Trp Thr Gly Leu Pro 
1230 1235 1240 

GAC ACC GGA ACC GCC CGG GAC CAC ACC TGG GCG GTG GCC GGC GGC CCG 
Asp Thr Gly Thr Ala Arg Asp His Thr Trp Ala Val Ala Gly Gly Pro 
1245 1250 1255 

GGC GAC CTG TTA CCC GGG GAG ACC CCG CAC CAC CCC GAC CTC GCC TCC 
Gly Asp Leu Leu Pro Gly Glu Thr Pro His His Pro Asp Leu Ala Ser 
1260 1265 1270 1275 

GCG CTC GCC GAC ACC GGC ACC GCC CCC TTC CGC GTA CTC GCG GAT CTC 
Ala Leu Ala Asp Thr Gly Thr Ala Pro Phe Arg Val Leu Ala Asp Leu 
1280 1285 1290 

CGC GGA TAC GGC ACC GCC ACC CCC CGG GAA CTC GCC TCC CAG GCG CTC 
Arg Gly Tyr Gly Thr Ala Thr Pro Arg Glu Leu Ala Ser Gin Ala Leu 
1295 1300 1305 

GCC CTC GTC CAG CAG TGG GCC GAC GCG GCC GAG GCC GCC GAA GGC AGG 
Ala Leu Val Gin Gin Trp Ala Asp Ala Ala Glu Ala Ala Glu Glv Aro 
1310 1315 1320 

CTC GTC CTG GTG ACA CCC CGG GCC GTC GAC ATC GGT GAC GGC GTC ACG 
Leu Val Leu Val Thr Arg Arg Ala Val Asp He Gly Asp Gly Val Thr 
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132S 1330 1335 

GAC CCG GCG GCG GCG ACC GTG TGG GGA CTG GTC CGG GCG GCA CAG TCC 
Aap Pro Ala Ala Ala Thr Val Trp Gly Leu Val Arg Ala Ala Gin Ser 
1340 1345 1350 135 5 

GAG CAC CCC CGG TGC TTC CCG CTC CTC GAC ACC GAC GAC TCC CCC CGC 
Glu His Pro Gly Cys Phe Ala Leu Leu Asp Thr Asp Asp Ser Pro Arg 
13«0 1365 1370 

TCC CGG CAA CTC CTC CCA CGC CTC GCG GGC ACC GCC GAG CAC CTC GCA 
Ser Arg Gin Leu Leu Pro Arg Val Ala Gly Thr Ala Glu Gin Leu Ala 
1375 1380 1385 

CTC CGC GAC GGC ACC CTG CTC GCC CCC TCC CTC ACC CGT GCC ACG CTG 
Leu Arg Asp Gly Thr Leu Leu Ala Pro Ser Leu Thr Arg Ala Thr Leu 
13 90 1395 140Q 

CCC GCC GGC GCC CCG CTG CCC GCA CTC GAC GGC ACC GTC CTG ATC ACT 
Pro Ala Gly Ala Arg Leu Pro Ala Leu Asp Gly Thr Val Leu He Thr 
1405 1410 1415 

GGG GGC ACC GGC AGC CTC GGC GCG GAG GCG GCC CGC CAT CTG GTC ACC 
Gly Gly Thr Gly Ser Leu Gly Ala Glu Ala Ala Arg His Leu Val Thr 
1420 1425 1430 1435 

CGG CAC GOT GCC CGG CGC CTG CTC CTG ACC AGC CGA AGC GGC CCG CAG 
Arg His Gly Ala Arg Arg Leu Leu Leu Thr Ser Arg Ser Gly Pro Gin 
1440 1445 1450 

GCC CCC GGC GCG GCC GAA CTC GTC GCC GAA CTG GCC GCC TTG GGC GCC 
Ala Pro Gly Ala Ala Glu Leu Val Ala Glu Leu Ala Ala Leu Gly Ala 
1*55 146O 1465 

CAC GCG GAC GTG GCC GCC TGC GAC GTC GCC GAC CGC GCC GCC CTG CGG 

^?« Val A1 * A1 * Val Ala **P **g Ala Ala Leu Arg 

1470 1475 1480 

GCC CTG CTC GAC CGC GTA CCC GCC GGC CAC CCG CTG ACC GCG GTC CTG 

P **» Val Pro A1 * Hi » p "> Thr Ala Val Leu 
1 * Bi 1490 1495 

CAC ACG GCG GGC GTC CTG GAC GAC GGC GTC CTC ACC GCC CAG ACA CCT 
His Thr Ala Gly Val Leu Asp Asp Gly Val Leu Thr Ala Gin Thr Pro 
1500 15 °S 1510 1515 

CAG CGG CTC GCG GCC GTC CTC CCC CCG AAG GCC GAC GCC GTA CGC AAT 
Gin Arg Leu Ala Ala Val Leu Arg Pro Lys Ala Asp Ala vll 5 i£ 
1520 1525 15 3o 

CTG CAC GAA CTC ACC CAC GGG CAC GCC CTC TCG GCG TTC ATC CTC TAC 
Leu His Glu Leu Thr Gin Gly His Ala Leu Ser Ala Jne iS SS S? 

1535 1540 1545 * 

TCG TCG GCC GCC GGA GTG CTC GGC AGC GCG GGC CAG AGC GGC TAC GCC 
Ser Ser MaAla Gly Val Leu Gly_Ser Ala Gly Gin s£ cTy £r Sa 

Til !^ f° !f I AC f TC CAC TCC TTC GCC GTC TGG CGG CGG AGC CGG 
Ala Ala Asn Ala Tyr Leu Asp Ser Phe Ala Val Trp Arg Arg Ser Arg 
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1565 1570 1575 

GGA CTC CCC GCC GTA TCG CTC GGC TGG GGC CCG TC3G GAC GGC GGC OGC 
Gly Leu Pro Ala Val Ser Leu Gly Trp Gly Pro Trp Asp Gly Gly Gly 
1580 1585 1590 1595 

ATC GCG AGC GGA CTC GGC GGA ACC GAC ACG GCC CGG CTC COG CGC AGC 
Met Ala Ser Gly Leu Gly Gly Thr Asp Thr Ala Arg Leu Arg Arg Ser 
I 600 1605 1610 

GGC ATC GCA CCC CTC AGC CGC GCC GAG GGC CTC GCC GCG CTC GAC GCG 
Gly He Ala Pro Leu Ser Arg Ala Glu Gly Leu Ala Ala Leu Asp Ala 
1 6 15 1620 1625 

GCG CTC GCG GCC GGC GGC GAC GAC ACC GCG CCG GCC CAC CTC CTC CCC 
Ala Leu Ala Ala Gly Gly Asp Asp Thr Ala Pro Ala His Leu Leu Pro 
1630 1635 1640 

ATC CGC GTC GAC GCG GTG ACC CTC CGC GGC GCC GAC ACC GTC CCC GCC 
He Arg Val Asp Ala Val Thr Leu Arg Gly Ala Asp Thr Val Pro Ala 
1*45 1650 1655 

GTC CTC CGC GAC CTC GCG GGA ACC GCG CCA AGC GCC GCC GAA CGG CCC 
Val Leu Arg Asp Leu Ala Gly Thr Ala Pro Ser Ala Ala Glu Arg Pro 
1660 166S i 6 70 9 i 67 5 

CCC GGG ACA CCG GAG GAC ACG AAC GCG CCC CTC GCG GAC GTC ACC CAA 
Pro Gly Thr Pro Glu Asp Thr Asn Ala Pro Leu Ala Asp Val Thr Gin 
I 680 1685 1690 

CTC CAC GGC CGG GAA CGG AAG GAG GCA CTC ACC GGC TTC GTA CGC GCC 
Leu His Gly Arg Glu Arg Lys Glu Ala Leu Thr Gly Phe Val Arg Ala 
1«95 1700 1705 

CAG GTC GCC GCG GTC CTC GGC CAC CCC ACG TCC GAC ACG ATC GAC GTC 
Gin Val Ala Ala Val Leu Gly His Pro Tfcr Ser Asp Thr He Asp Val 
mo 1715 1720 

CGC CGG AGC TTC AAG GAA GCG GGC TTC GAC TCC CTC ACC GCC GTC GAA 

5 ^?c Ser Phe LyS Glu Ma Gly Phe Ser «»r Ala Val Glu 
1725 1730 1735 

CTC CGC AAC CGG CTC CGC GCC GCC ACC GGG CTC AAG CTC CCC GCC ACG 
Leu Arg Asn Arg Leu Arg Ala Ala Thr Gly Leu Lys Leu Pro Ala Hut 
1740 1745 1750 1755 

CTC GTC TTC GAC CAC CCG ACC CCC CTC GCG CTC GCC GGC TTC CTC CAC 
Leu Val Phe Asp His Pro Thr Pro Leu Ala Leu Ala Gly Phe Leu His 
1760 1765 1770 

CGC GAA CTC CCC GGC GCC GAA GCC TCC CTC ATC AGC GCG ATC GAC ACT 
Arg Glu Leu Pro Gly Ala Glu Ala Ser Leu Met Ser Ala He Asp Thr 
I 775 1780 i7 8 5 

CTC CGG CAC CGG CTC CGC GAC GCC CTC GCC GAC GAC GCC GCA GAC GAC 
Leu Arg His Arg Leu Arg Asp Ala Leu Ala Asp Asp Ala Ala Asp Asp 
1790 1795 1800 

GCC CTC CGC GAC CAG ATC ACC CGA CGA CTC GAG ACC CTC CTC GCC GGC 
Ala Leu Arg Asp Gin He Thr Arg Arg Leu Glu Thr Leu Leu Ala Gly 
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ATA GCC CGG ACC GAG GAG CCC GCG CCC GCC ACC GCC GCC GCC GAC GAC 19855 
He Ala Arg Thr Glu Glu Pro Ala Pro Ala Thr Ala Ala Ala Asp Asp 
1820 1825 1830 1835 

GGG AGC GGA GCC GGT GAT GTC GCG GAA CGA CTG AGC ACC GCG TCG GAC 19903 
Gly Ser Gly Ala Gly Asp Val Ala Glu Arg Leu Ser Thr Ala Ser Asp 
1840 1845 1850 

GAC GAA CTG TTC GAA CTG CTC GAC AGC GGT TTC ACA CCC TGA CCGGCCGGCG 19955 
Asp Glu Leu Phe Glu Leu Leu Asp Ser Gly Phe Thr Pro * 

1855 i860 1865 

GACCGGCACC ACGACATCGA GGCGACCGCA CCGAACACCC CCAGGGGGAA AGCT 20009 

GTG TCC ACC GAG AAC TCC ACC AAC GTC CCG GCG AGC GAG 20048 
Met Ser Thr Glu Asn Ser Thr Asn Val Pro Ala Ser Glu 
1 5 io 

GAC AAG CTC CGC GCC TAT CTG CGT CGC GCG ATG GCC GAC CTC CAC GAG 20096 
Asp Lys Leu Arg Ala Tyr Leu Arg Arg Ala Met Ala Asp Leu His Glu 
15 20 25 

TCC CGC GAA CGG CTC CGC GCC ACG GAA GCG CGC GCC CAG GAG CCG ATC 20144 
Ser Arg Glu Arg Leu Arg Ala Thr Glu Ala Arg Ala Gin Glu Pro He 
30 35 40 45 

GCG GTG GTG GGT ATG GGG TGC CGG TTC CCC GGT GGG GTG GGT TCG CCG 20192 
Ala Val Val Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro 
50 55 60 

GAG GCG TTG TGG CGG TTG GTG GTG GAG GGG GTG GAC GCG GTT TCC CCG 20240 
Glu Ala Leu Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro 
65 70 75 

TTT CCC GGT GAT CGT GGC TGG GAT GTG GAG GGG TTG TAC GAC CCG GAG 20288 
Phe Pro Gly Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu 
80 85 90 

CCG GGT GTG GCG GGG AAG TCG TAT GTG CGG GAG GGG GGT TTT CTG CAT 20336 
Pro Gly Val Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu His 
95 100 105 

GAT GCG GCG GAG TTC GAT GCG GAG TTC TTC GGG ATT TCG CCG CGT GAG 20384 
Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu 
110 H5 120 125 

GCG GTG GCG ATG GAT CCG CAG CAG CGG CTG TTC CTG GAG ACC TCC TGG 20432 
Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp 
130 135 140 

GAG GCG ATC GAG CGG GCG GGT ATC GAC CCG CAT TCG CTG CAC GGC AGC 20480 
Glu Ala He Glu Arg Ala Gly He Asp Pro His Ser Leu His Gly Ser 
145 150 155 

CGC ACC GGC GTC TAC GCC GGC GTG ATG TAC CAC GAC TAT GGC ACG GGA 20528 
Arg Thr Gly Val Tyr Ala Gly Val Met Tyr His Asp Tyr Gly Thr Gly 
160 165 170 
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CAG ACC TCC GCG ACC GAC ACG AGC GGT TAT TCC GGC ACC GGT ACG TCG 20576 
Gin Thr Ser Ala Thr Asp Thr Ser Gly Tyr Ser Gly Thr Gly Thr Ser 
175 180 185 

GGG AGT GTG GTG TCG GGG CGT GTG GCC TAC ACG CTG GGG CTG GAG GGT 20624 
Gly Ser Val Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu Gly 
190 195 200 205 

CCG GCC GTG ACC GTG GAT ACG GCG TGT TCG TCG TCG TTG GTG GCG TTG 20672 
Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu 
210 215 220 

CAT CTG GCG GTG CAG GCG TTG CGG GGT GGC GAG TGT GAC ATG GCG TTG 20720 
His Leu Ala Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala Leu 
225 230 235 

GCC GGT GGT GTG ACG GTG ATG GCC GGG CCG GGG ATG TTC GTG GAG TTT 20768 
Ala Gly Gly Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu Phe 
240 245 250 

TCG CGG CAG CGG GGG TTG GCG GCC GAT GGG CGG TGC AAG GCG TTC GCG 20816 
Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala 
255 260 265 



GAT GGG GCG GAT GGG ACC GCT TGG GCC GAG GGT GCG GGG GTG GTG CTG 20864 
Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val Leu 
25 270 275 280 285 

GTG GAG CGG TTG TCG GAT GCC CGG CGG TTG GGG CAT CCG GTG TTG GCG 20912 
Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu Ala 
290 295 300 

30 

GTG GTG TGT GGG TCG GCG GTG AAT CAG GAC GGT GCG TCG AAT GGT TTG 20960 
Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu 
305 310 315 

ACG GCG CCG AGT GGT CCG TCG CAG GAG CGG GTG ATT CGT CAG GCG TTG 21008 
Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val He Arg Gin Ala Leu 
320 325 330 

GCG AAT GCG CGG TTG ACG GTG GCG GAT GTG GAT GTG GTG GAG GCG CAT 21056 
Ala Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala His 
40 335 340 345 

GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG GCG CAG GCG TTG CTG 21104 
Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu 
350 355 360 365 

45 

GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT CCG GTG TGG TTG GGG 21152 
Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu Gly 
370 375 380 

TCG TTG AAG TCG AAT ATT GGT CAT GCT CAG GCG GCT GCG GGG GTG GCT 21200 
Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Ala 
385 390 395 

GGT GTG ATC AAG ATG GTG TTG GCG ATG CGG TAT GGG TGG TTG CCG CGG 21248 
Gly Val lie Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro Arg 
55 400 405 410 
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CGC CAT GTG GAG GAC CTG AAG GGC GAG TTG GAG CGG GTG CTG TCC GGT 
Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly 
785 790 795 



55 



ACC GGG ACC CTG CGC CGC CAG GAC GAC AGC CCG CAC CGC CTG CTC ACC 
Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr 
880 885 890 



22064 



GCG GTG ATG GTG TCG CTG GCC CGG TAC TGG CAG GCG ATC GGT GTG GAC 22016 
Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp 
655 660 665 

GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG ATC GCT GCT GCC ACG 
Val Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr 
670 675 680 685 

GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT CTG 22112 
Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu 
690 695 700 

CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATG GCG 22160 
Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala 
705 710 715 

GCT GTT CCG CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG TGG 22208 
Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp 
720 725 730 

20 CCG GGT GTG GAG GTC GCG GCG GTC AAC GGT CCG GCG TCT ACG GTC GTT 22256 

Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val 
735 740 745 

TCC GGG GAT CGG CGG GCG GTG GCC GGT TAT GTC GCC GTC TCT CAG GCG 223 04 

25 nil Gly ASP Ar9 Arg Ala Val Ala G1 * 1 V r Val Ala Cys Gin Ala 

750 755 760 765 

GAG GGT GTG CAG GCT CGG TTG ATA CCG GTG GAC TAC GCC TCT CAC TCC 22352 
Glu Gly Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser 
770 775 780 



22400 



35 tT^ * C ° CGC AGT CCG CGG GTG CCG GTG TCT TCC ACC GTC GCC GGA 22448 

lie Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly 
800 805 810 

GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG TAT TCG TTC CGT AAT 
Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn 
40 815 820 825 

CTC CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTG TTG GAG 
Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu 
830 835 840 845 



22496 



22544 



22592 



GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC GTT 
Glu Gly His Arg Arg Phe lie Glu Val Ser Ala His Pro Val Leu Val 
850 855 860 

CAT GCC ATT GAG CAG ACG GCC GAG GCC GCG GAC CGG AGT GTC CAT GCC 22640 
His Ala lie Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val lis Sa 
8 65 870 875 
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TCC ACC CCC GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TCG GAC CCC 
Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro 
895 900 905 

GCC CTG CCC CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC AAC 
Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn 
910 915 920 925 

CAC CAC CAC TAC TGG CTC GAC ACC ACC CCC ACC ACC CCC GCG ACG ACC 
Has His His Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr Thr 
930 935 940 

ACC CAG AGC CCC ACC GAT GCC CAG AAC CCC GCC GAC GCC CTT CCC TAC 
Thr Gin Ser Pro Thr Asp Ala Gin Asn Pro Ala Asp Ala Leu Pro Tyr 
945 950 955 

AAG GTG AGT TGG AAG CGG TTG CGG GAC CAG GAC AGC TTG ACC GCG CGC 
Lys Val Ser Trp Lys Arg Leu Arg Asp Gin Asp Ser Leu Thr Ala Arq 
960 965 970 

CTC GAC GGC CGA TGG CTG CTG GTG GTA CCG GAG GCG TCG GCG GAC CCG 
Leu Asp Gly Arg Trp Leu Leu Val Val Pro Glu Ala Ser Ala Asp Pro 
97 5 980 985 

TCG GTT GCT GAG GGC GTC GCG CGC GAG CTG ACC GCG CGG GGC GCG ACC 
Ser Val Ala Glu Gly Val Ala Arg Glu Leu Thr Ala Arg Gly Ala Thr 
990 995 1000 1005 

GTG GAG TCG CTG ACG GTC GAG CCG GGC GCC GAC CGT TCG CGG CTG CGC 
Val Glu Ser Leu Thr Val Glu Pro Gly Ala Asp Arg Ser Arg Leu Arg 
1010 1015 1020 

GGG CTG CTG GTC GAC GCC ACG GAA CGG GAC GAA GCC GGG CCG CTG CGC 
Gly Leu Leu Val Asp Ala Thr Glu Arg Asp Glu Ala Gly Pro Leu Arg 
!025 1030 1035 

GGG ATC GTC TCG CTG CTG GCG TTG GCC GGG GAC CAC GCC GGG GCC GAC 
Gly lie Val Ser Leu Leu Ala Leu Ala Gly Asp His Ala Gly Ala Asp 
1040 1045 1050 

GGG GCA CGC CCG GTG GTT CCG GCC GGC CTG GCA GCG TCA CTG GCG CTC 
Gly Ala Arg Pro Val Val Pro Ala Gly Leu Ala Ala Ser Leu Ala Leu 
1055 1060 1065 

ATC CAG GCC GCG GGC GAC GCG GGC ACG GAG GCC GGG CTG TGG GCG GTC 
lie Gin Ala Ala Gly Asp Ala Gly Thr Glu Ala Gly Leu Trp Ala Val 
1070 1075 1080 1085 

ACC CGC GGC GCC GTC GCG GCC GTG CCC GGT GAC GTA CCG GCG CCG TCG 
Thr Arg Gly Ala Val Ala Ala Val Pro Gly Asp Val Pro Ala Pro Ser 
1Q 90 1095 noo 

CAG GCG CTG CTC TCG GGA TTC GGC CGG GTG GCC GGG ATC GAG CTG CCG 
Gin Ala Leu Leu Trp Gly Phe Gly Arg Val Ala Gly lie Glu Leu Pro 
1105 1110 ins 

CAC TCC TGG GGC GGC CTG CTC GAC CTG CCG ACC GGG CCC GGC GAC TCC 
His Cys Trp Gly Gly Leu Leu Asp Leu Pro Thr Gly Pro Gly Asp Ser 
1120 U25 H30 
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GGG TTC CGG CAA CTG GCC GCG ACC CTG GCG GGC CGT CCC GCG GAG GAC 23456 

Gly Phe Arg Gin Leu Ala Ala Thr Leu Ala Gly Arg Pro Ala Glu Asp 

1135 1140 1145 

CAG GTG GCG CTG CGG GCT TCG GGC GCG TAC GGC CGC AGG CTG GTC CGG 23504 
Gin Val Ala Leu Arg Ala Ser Gly Ala Tyr Gly Arg Arg Leu Val Arg 
1150 1155 1160 1165 

GCC TCC GCG GCG GGC GGC GCG GAC GGC TGG CGG CCG CGG GGA ACG GTG 23552 
Ala Ser Ala Ala Gly Gly Ala Asp Gly Trp Arg Pro Arg Gly Thr Val 
1170 1175 1180 

CTG GTG GTG GGC GAC ACC GCG GAG GTC GCC GGA CCG CTG GTG CGC TGG 23600 
Leu Val Val Gly Asp Thr Ala Glu Val Ala Gly Pro Leu Val Arg Trp 
1185 1190 1195 

CTG CTC GGC AAC GGC GCA CGG CGG GTG ACC CTG TCC GGA CTC TCC GGC 23648 
Leu Leu Gly Asn Gly Ala Arg Arg Val Thr Leu Ser Gly Leu Ser Gly 
1200 1205 1210 

CCG CTG CCG GAG GAA CTC GCC GAT GTG GCG GCA CGG GTG ACC GTG GCG 23 696 

Pro Leu Pro Glu Glu Leu Ala Asp Val Ala Ala Arg Val Thr Val Ala 
1215 1220 1225 



CCC TGT GAT CCG GCC GAT CGC CCC GCC CTG CGG ACG CTG CTC GCC GAA 23744 
Pro Cys Asp Pro Ala Asp Arg Pro Ala Leu Arg Thr Leu Leu Ala Glu 
25 1230 1235 1240 1245 

CAG GCG CCG ACC GCC GTG CTC GTG GCG CCC CCG GCC GTC CCG CCC ACG 23792 
Gin Ala Pro Thr Ala Val Leu Val Ala Pro Pro Ala Val Pro Pro Thr 
1250 1255 1260 

30 

CCG CTC GCG GAG ATG ACC GCC GAG GCG TTA GCC ATC GCG CTG TCC GCG 23840 
Pro Leu Ala Glu Met Thr Ala Glu Ala Leu Ala He Ala Leu Ser Ala 
1265 1270 1275 

AAG ACC GGT CTG GTC GAC CGT CTG GAC TCG CTG CTC GAC GAG CCG GAC 23 888 

Lys Thr Gly Leu Val Asp Arg Leu Asp Ser Leu Leu Asp Glu Pro Asp 
1280 1285 1290 

CCC CTG CTC GAG GAC GGG GAA CTC GAC GCG TTC GTC GTC TTC TCC TCC 23 93 6 

Pro Leu Leu Glu Asp Gly Glu Leu Asp Ala Phe Val Val Phe Ser Ser 
40 1295 1300 1305 

GTG GCA GGG GTG TGG GGC GGC GCG GGA CAG GGT GGT TAC GCG GCC GGT 23984 
Val Ala Gly Val Trp Gly Gly Ala Gly Gin Gly Gly Tyr Ala Ala Gly 
1310 1315 1320 1325 

45 

ACC GCG TAC CTC GAC GCG CTC GCC GAA TGC CGG CGG GCC GGG GGG CTG 24032 
Thr Ala Tyr Leu Asp Ala Leu Ala Glu Cys Arg Arg Ala Gly Gly Leu 
1330 1335 1340 

CCG GTC ACC TCG GTG GCG TGG ACG CCG TGG CTC GGT ACG CCG GCG GCG 24080 
Pro Val Thr Ser Val Ala Trp Thr Pro Trp Leu Gly Thr Pro Ala Ala 
1345 1350 1355 

GAC TCC CTG GGC GAG CAG ATG AGC CGA GCT GGC ATC ACC CCC CTG GAT 24128 
Asp Ser Leu Gly Glu Gin Met Ser Arg Ala Gly He Thr Pro Leu Asp 
55 1360 1365 1370 
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CCC GCG GCC TCG CTG GAT GCG CTC GCC CGT GCG GTG GGC CGG CGC GCG 
Pro Ala Ala Ser Leu Asp Ala Leu Ala Arg Ala Val Gly Arg Arg Ala 
1375 1380 1385 

GGC TGT GTG ACG GTC GCC GAC ATC GAC TGG GAG CGG TTC GCC TCC GCG 
Gly Cys Val Thr Val Ala Asp He Asp Trp Glu Arg Phe Ala Ser Ala 
1390 1395 1400 1405 

TAC ACG GCC ACC CGT CCC ACG CCG ATG TTC GAC GAG GTG CCC GAG GTG 
Tyr Thr Ala Thr Arg Pro Thr Pro Met Phe Asp Glu Val Pro Glu Val 
1410 1415 1420 

CGG CGG ATA CAG GCC GCG TGG GCG GAA GCG GAG GCC GAC GCC GCG CGC 
Arg Arg He Gin Ala Ala Trp Ala Glu Ala Glu Ala Asp Ala Ala Arg 
1425 1430 1435 

AGC GGT GCC GGC GGC GAC TCG CAG CTG CTG CGC TCC CTC CGG GGC CGG 
Ser Gly Ala Gly Gly Asp Ser Gin Leu Leu Arg Ser Leu Arg Gly Ara 
1440 1445 1450 

CCC GAG GAG GCC CAA CTG GCG GAG CTG CTG CGG CTG GTG CGC ACC CAT 
Pro Glu Glu Ala Gin Leu Ala Glu Leu Leu Arg Leu Val Arg Thr His 
1455 1460 1465 

GCC GCC GCG GTG CTC GGC CTG GGC TCG CCC GGC GCG GTG GAG GCG CGG 
Ala Ala Ala Val Leu Gly Leu Gly Ser Pro Gly Ala Val Glu Ala Ara 
1470 1475 1480 1485 

CGT TCG TTC AAG GAC CTG GGC TTC AAC TCG GTG ACG GCG GTG GAG CTG 
Arg Ser Phe Lys Asp Leu Gly Phe Asn Ser Val Thr Ala Val Glu Leu 
I 490 1495 1500 

CGG AAC CGG CTG AAG GAG GCG ACG GGA CTC CGG CTG GAG GTG TCC CTC 
Arg Asn Arg Leu Lys Glu Ala Thr Gly Leu Arg Leu Glu Val Ser Leu 
1505 1510 1515 

GTC TTC GAC CAC CCG GAC CCG GCC TCC CTC GCC CGG CAT CTG CTG GAT 
Val Phe Asp His Pro Asp Pro Ala Ser Leu Ala Arg His Leu Leu Asp 
1520 1525 1530 

CTC GCC CTC GGC CAG GAG CCG GAG GAG ACG CCG CGG GCG TTC GCG CTC 
Leu Ala Leu Gly Gin Glu Pro Glu Glu Thr Pro Arg Ala Phe Ala Leu 
1535 1540 1545 

GAA CCC GCG CCG AAC GGG GAG CCG ATC GCG ATC GTG TCC ATG GCC TGC 
Glu Pro Ala Pro Asn Gly Glu Pro lie Ala lie Val Ser Met Ala Cys 
1550 I 5 " 1560 is 65 

CGT ATG CCG GGG GGT GTC AGC ACG CCC GAG GAG CTG TGG CGG CTC CTC 
Arg Met Pro Gly Gly Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Leu 
I 570 1575 1580 

CGG GAC GGC AAG GAC GCG ATC GGG CCG TTC CCC GCC AAC CGG GGC TCG 
Arg Asp Gly Lys Asp Ala He Gly Pro Phe Pro Ala Asn Arg Gly Trp 
1585 1590 1595 

GAC CTC GAG AAC CTC TAC GAC CCC GAC CCG GAC GCC GAC GGC CGC ACC 
Asp Leu Glu Asn Leu Tyr Asp Pro Asp Pro Asp Ala Asp Gly Arg Thr 
I 600 1605 16 io 
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TAT GTG CGC GAG GGC GGA TTC CTC CAC GAG GCA CCG GAC TTC GAC CCC 
Tyr Val Arg Glu Gly Gly Phe Leu His Glu Ala Pro Asp Phe Asp Pro 
1615 1620 1625 

TCG TTC TTC GGC ATC TCG CCG CGC GAG GCG CTG GCG ATG GAC CCG CAG 
Ser Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin 
1630 1635 1640 1645 

CAG CGG CTG CTG CTG GAG ACT TCC TGG GAG GCC TTG GAG CGC GCC GGC 
Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly 
1650 1655 1660 

ATC GAC CCG GCG AGG CTG CGC GGC AGC CGT ACC GGC GTC TTC GTC GGG 
He Asp Pro Ala Arg Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly 
1665 1670 1675 

ACG AAC GGC CAG CAC TAC ATG CCG CTG CTC CAG AAC GGC GGG GAC AGC 
Thr Asn Gly Gin His Tyr Met Pro Leu Leu Gin Asn Gly Gly Asp Ser 
1680 1685 1690 

TTC GAC GGC TAT CTG GGC ACC GGC AAC TCG GCG AGT GTG ATG TCG GGC 
Phe Asp Gly Tyr Leu Gly Thr Gly Asn Ser Ala Ser Val Met Ser Gly 
1695 1700 1705 

CGG CTG TCG TAC GTG TTC GGC CTC GAA GGC CCC GCC GTG ACC GTG GAC 
Arg Leu Ser Tyr Val Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp 
1710 1715 1720 1725 

ACC GCC TGC TCG GCG TCC CTG GTG GCG CTG CAC CTG GCG GTG CAG GCG 
Thr Ala Cys Ser Ala Ser Leu Val Ala Leu His Leu Ala Val Gin Ala 
1730 1735 1740 

ATG CGG CGC GGC GAG TGC GAC ATG GCG CTG GTC GGC GGC GCG ACG GTC 
Met Arg Arg Gly Glu Cys Asp Met Ala Leu Val Gly Gly Ala Thr Val 
1745 1750 1755 

ATG TCG ACG CCC GAG ATG CTG GTG GAG TTC TCC CGG CAG CGG GTG ATC 
Met Ser Thr Pro Glu Met Leu Val Glu Phe Ser Arg Gin Arg Val He 
1760 1765 1770 

TCC GCC AAC GGC CGG TCG AGG GCC TTC GCC GCC GGT GCC GAC GGT GTG 
Ser Ala Asn Gly Arg Ser Arg Ala Phe Ala Ala Gly Ala Asp Gly Val 
1775 1780 1785 

GCG CTC GGC GAG GGC GTG GGC GTC CTG CTG GTG GAG CGG CTG TCG GAC 
Ala Leu Gly Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Aso 
1790 1795 1800 1805 

GCC GAG CGC AAC GGC CAT CCG GTG CTG GCG GTG GTC CGC GGC TCG GCG 
Ala Glu Arg Asn Gly His Pro Val Leu Ala Val Val Arg Gly Ser Ala 
ISIO 1815 1820 

GTC AAC CAG GAC GGC GCC TCC AAC GGG CTG ACG GCG CCC AAC GGG CCC 
Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro 
1825 1830 1835 

TCC CAG CAG CGG GTG ATC CGG CAG GCG CTG GCG GAC GCC GGG CTC CGG 
Ser Gin Gin Arg Val He Arg Gin Ala Leu Ala Asp Ala Gly Leu Ara 
1840 1845 1850 
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CCC GAG GAC ATC GAC GCC GTC GAG GCG CAC GGC ACC GGC ACC GAG CTG 
Pro Glu Asp He Asp Ala Val Glu Ala His Gly Thr Gly Thr Glu Leu 
1855 i860 1865 

GGC GAC CCC ATC GAG GCC GAG GCG CTG CTC GCC ACC TAT GGA AGG ACC 
Gly Asp Pro He Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Arg Thr 
1870 1875 1880 1885 

CGT ACG GCG GAC CGC CCG CTG TGG CTC GGC TCC CTC AAG TCC AAC ATC 
Arg Thr Ala Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn lie 
1890 1895 1900 

GGG CAC ACC CAG GCC GCC GCC GGC GTG GCG GGC GTC ATC AAG ATG GTC 
Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val 
1905 1910 1915 

CTC GCG CTC GGC AAC GAG ACA CTC CCG CGC ACC CTG CAC GTG GAT GAG 
Leu Ala Leu Gly Asn Glu Thr Leu Pro Arg Thr Leu His Val Asp Glu 
1920 1925 1930 

CCC ACA CCG CGC GTC GAC TGG TCC TCT GGC GCG GTC TCC CTG CTC ACC 
Pro Thr Pro Arg Val Asp Trp Ser Ser Gly Ala Val Ser Leu Leu Thr 
1935 1940 1945 

GAG CCG GTC GAC TGG CCC GCC GGC CCG TCC GCG CCG CGC CGT GCG GCC 
Glu Pro Val Asp Trp Pro Ala Gly Pro Ser Ala Pro Arg Arg Ala Ala 
1950 1955 i960 1965 

GTC TCC TCG TTC GGC ATC AGC GGC ACC AAC GCC CAC ACG ATC CTC GAG 
Val Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Thr He Leu Glu 
1970 1975 1980 

CAG GCC CCC GTC CCC GCG GAG TCC CGC CCC GGG ACG GAG CCG GCG GAC 
Gin Ala Pro Val Pro Ala Glu Ser Arg Pro Gly Thr Glu Pro Ala Asp 
1985 1990 1995 

GGC ACG GGC GCG TCG GAG AAC GTG ACC GTT CCG CTC CTC CTC TCC GGC 
Gly Thr Gly Ala Trp Glu Asn Val Thr Val Pro Leu Leu Leu Ser Gly 
2000 2005 2010 

CAC ACC GAG GCG GCG CTC CGC GAG CAG AGC ACG AGG CTG CTG AAC GAC 
His Thr Glu Ala Ala Leu Arg Glu Gin Ser Thr Arg Leu Leu Asn Asp 
2015 2020 2025 

CTC CTG GAG CAC CCG GAC GAG CAC CCG GCC GAC GTC GGC TAC ACC CTG 
Leu Leu Glu His Pro Asp Glu His Pro Ala Asp Val Gly Tyr Thr Leu 
2030 2035 2040 2045 

ATC ACC GGC AGG GCC CAC TTC GGG CAC CGG GCC GCC GTG ATC GGC GAG 
He Thr Gly Arg Ala His Phe Gly His Arg Ala Ala Val lie Gly Glu 
2050 2055 2060 

AGC CGG GAA GAA CTG CTC GAC GCC CTG AAG GCT CTC GCC GAG GGC CGC 
Ser Arg Glu Glu Leu Leu Asp Ala Leu Lys Ala Leu Ala Glu Gly Arg 
2065 2070 2075 

GAG CAC CAC ACC GTG GTA CGG GGC GAC GGG ACG GCC CAC CCG GAC CGG 
Glu His His Thr Val Val Arg Gly Asp Gly Thr Ala His Pro Asp Arg 
2080 2085 2090 
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CGC GTG GTC TTC GTC TTC CCC GGG CAG GGC TCG CAG TGG CCG TCG ATG 
Arg Val Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Pro Ser Met 
2095 2100 2105 



26336 



GCC CGG GAC CTG CTC GAC CGC GCG CCC GCC TTC CGC GAG ACG GCG AAG 
Ala Arg Asp Leu Leu Asp Arg Ala Pro Ala Phe Arg Glu Thr Ala Lys 
2110 2115 2120 2125 



26384 



10 



GCC TGC GAC GCC GCG CTG AGC GTC CAT CTG GAC TGG TCC GTG CTC GAT 
Ala Cys Asp Ala Ala Leu Ser Val His Leu Asp Trp Ser Val Leu Asp 
2130 2135 2140 



26432 



15 



GTC CTC CAG GAG AAG CCG GAC GCG CCG CCG CTG AGC CGG GTC GAC GTG 26480 
Val Leu Gin Glu Lys Pro Asp Ala Pro Pro Leu Ser Arg Val Asp Val 
2145 2150 2155 

GTG CAG CCC GTG CTG TTC ACG ATG ATG CTG TCG CTC GCC GCC TGC TGG 26528 
Val Gin Pro Val Leu Phe Thr Met Met Leu Ser Leu Ala Ala Cys Trp 
2160 2165 2170 



20 CGG GAC CTC GGC GTC CAC CCG GCC GCC GTG GTG GGC CAC TCC CAG GGA 

Arg Asp Leu Gly Val His Pro Ala Ala Val Val Gly His Ser Gin Gly 
2175 2180 2185 



26576 



25 



GAG ATC GCG GCG GCC TGC GTG GCC GGC GCG CTC TCC CTG GAG GAC GCG 
Glu He Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala 
2190 2195 2200 2205 



26624 



30 



GCG CGG ATC GTG GCG CTG CGC AGC CGG GCA TGG CTC ACA CTG GCC GGC 26672 

Ala Arg He Val Ala Leu Arg Ser Arg Ala Trp Leu Thr Leu Ala Gly 
2210 2215 2220 

AAG GGC GGC ATG GCC GCC GTC TCC CTG CCG GAA GCC CGG CTG CGC GAG 26720 

Lys Gly Gly Met Ala Ala Val Ser Leu Pro Glu Ala Arg Leu Arg Glu 

2225 2230 2235 



35 



CGG ATC GAG CGG TTC GGG CAG CGG CTG TCG GTG GCC GCG GTG AAC AGC 
Arg He Glu Arg Phe Gly Gin Arg Leu Ser Val Ala Ala Val Asn Ser 
2240 2245 2250 



26768 



40 



CCG GGC ACG GCG GCG GTC GCC GGT GAC GTG GAC GCG CTG CGG GAA CTG 
Pro Gly Thr Ala Ala Val Ala Gly Asp Val Asp Ala Leu Arg Glu Leu 
2255 2260 2265 



26816 



45 



CTG GCG GAG CTG ACC GCG GAG GGC ATC CGG GCC AAG CCG ATC CCC GGC 26864 
Leu Ala Glu Leu Thr Ala Glu Gly He Arg Ala Lys Pro He Pro Gly 
2270 2275 2280 2285 

GTG GAC ACG GCC GGC CAC TCC GCG CAG GTG GAC GGC CTG AAG GAG CAT 26912 
Val Asp Thr Ala Gly His Ser Ala Gin Val Asp Gly Leu Lys Glu His 
2290 2295 2300 



50 



CTC TTC GAG GTG CTG GCG CCG GTC TCC CCG CGC TCC TCG GAC ATC CCG 
Leu Phe Glu Val Leu Ala Pro Val Ser Pro Arg Ser Ser Asp He Pro 
2305 2310 2315 



26960 



55 



TTC TAC TCG ACG GTG ACG GGC GCG CCG CTG GAC ACC GAG CGG CTG GAC 
Phe Tyr Ser Thr Val Thr Gly Ala Pro Leu Asp Thr Glu Arg Leu Asp 
2320 2325 2330 



27008 
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GCC GGG TAC TGG TAC CGC AAC ATG CGG GAG CCC GTG GAG TTC GAG AAG 
Ala Gly Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe Glu Lys 
2335 2340 2345 

GCC GTC AGG GCA CTG ATC GCC GAC GGC TAC GAC CTG TTC CTG GAG TGC 
Ala Val Arg Ala Leu He Ala Asp Gly Tyr Asp Leu Phe Leu Glu Cys 
2350 2355 2360 2365 

AAC CCG CAC CCG ATG CTC GCC ATG TCG CTG GAC GAG ACA CTC ACC GAC 
Asn Pro His Pro Met Leu Ala Met Ser Leu Asp Glu Thr Leu Thr Asp 
2370 2375 2380 

AGC GGC GGC CAC GGC ACC GTG ATG CAC ACC CTC CGC CGG CAG AAG GGC 
Ser Gly Gly His Gly Thr Val Met His Thr Leu Arg Arg Gin Lys Gly 
2385 2390 2395 

AGC GCC AAG GAC TTC GGC ATG GCG CTC TGC CTC GCC TAT GTC AAC GGA 
Ser Ala Lys Asp Phe Gly Met Ala Leu Cys Leu Ala Tyr Val Asn Gly 
2400 2405 2410 

CTG GAG ATC GAC GGA GAA GCC CTC TTC GGC CCC GAC TCA CGC CGG GTG 
Leu Glu He Asp Gly Glu Ala Leu Phe Gly Pro Asp Ser Arg Arg Val 
2415 2420 2425 

AAC CCG CCG ACG TAC CCG TTC CAG CGG GAG CGC TAC TGG TAC CAC CCC 
Asn Pro Pro Thr Tyr Pro Phe Gin Arg Glu Arg Tyr Trp Tyr His Pro 
2430 2435 2440 2 445 

ACG AGC GGC AGG CGC GGC GAC ATC ACG GCG GCC GGC GTG GCC GAG GCG 
Thr Ser Gly Arg Arg Gly Asp He Thr Ala Ala Gly Val Ala Glu Ala 
2450 2455 2460 

GAG CAC CCG CTG CTC GGC GCC GGC GTC GAA CTC CCG GAG ACC GGG GGC 
Glu His Pro Leu Leu Gly Ala Gly Val Glu Leu Pro Glu Thr Gly Gly 
2465 2470 2475 

ACG GTG TAC ACC GCG CGG TTC GGC CCG GAC AGC CGG CCG TGG CTG GCC 
Thr Val Tyr Thr Ala Arg Phe Gly Pro Asp Ser Arg Pro Trp Leu Ala 
2480 2485 2490 

GAC CAC GCG CTG CTG GGC ACC GTG CTG CTG CCC GGC ACG GCA ATC CTG 

ASP ™ A1 * LeU LeU Gly Thr Val Leu Leu Pro Thr Ala He Leu 
2495 2500 2505 

GAC CTG GTG CTG TGG GCG GGC GAA CGC TCC GGC TGC GGC CGC GTC GGT 
Asp Leu Val Leu Trp Ala Gly Glu Arg Ser Gly Cys Gly Arg Val Gly 
2510 2515 2520 2525 

GAA CTG GCG CTC CAG GCA CCG CTG GTC CTG CCG GAC AGC GGC GAC GTC 
Glu Leu Ala Leu Gin Ala Pro Leu Val Leu Pro Asp Ser Gly Asp Val 
2530 2535 2540 

GAA CTG CGG CTG CTG GTC GGC GGC CCG GAC GAG GAG AAA CGG CGC ACC 
Glu Leu Arg Leu Leu Val Gly Gly Pro Asp Glu Glu Lys Arg Arg Thr 
2545 2550 2 555 

£u C CAC GCG CGG CCC GCG GCG GCC «C GCC GAG GCG CCG TGG 

Val Thr Val His Ala Arg Pro Ala Ala Ala Gly Ala Glu Ala Pro Trp 
2560 2565 2 570 
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ACC CGG CAC GCC GAA GCC GTG GTG CTG CCC GCC ACC GGC GAG GAG CCG 
Thr Arg His Ala Glu Ala Val Val Leu Pro Ala Thr Gly Glu Glu Pro 
2575 2580 2585 

ACC CCC GCC CCG CGC CCC GTC CCC GAG CCG GCG GGC ACC ACG GAC CCC 
Thr Pro Ala Pro Arg Pro Val Pro Glu Pro Ala Gly Thr Thr Asp Pro 
2590 2595 2600 2605 

GCC GCG TTC TAC GCG GAG TTC GCC GAG CGC GGC TAC GAC TAC GGC CCG 
Ala Ala Phe Tyr Ala Glu Phe Ala Glu Arg Gly Tyr Asp Tyr Gly Pro 
2610 2615 2620 

GCC TTC CAG GGC TTC ACC GCC GGA GCG CGC CAC GGC GAG GAC GTC GTC 
Ala Phe Gin Gly Phe Thr Ala Gly Ala Arg His Gly Glu Asp Val Val 
2625 2630 2635 

GCC GAG GTG GCG CTG CCC AGC GGC CTG GTG GCG GAC GCC CGT CAC CAC 
Ala Glu Val Ala Leu Pro Ser Gly Leu Val Ala Asp Ala Arg His His 
2640 2645 2650 

CGG CTG CAC CCG GCG CTG CTC GAC GCC GCG CTC CAG GCG ATG ATC CTC 
Arg Leu His Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Met He Leu 
2655 2660 2665 

GGC ACG TTC TTT GCC GAC GAC GGC CGC GCC CGG ATG CCG TTC GCG GTC 
Gly Thr Phe Phe Ala Asp Asp Gly Arg Ala Arg Met Pro Phe Ala Val 
2670 2675 2680 2685 

CGC GGA GTA CGG CTG CAC ACG GCC GGC GCC GAC CGG CTG CGC GTC CTG 
Arg Gly Val Arg Leu His Thr Ala Gly Ala Asp Arg Leu Arg Val Leu 
2690 2695 2700 

ATC TCC CCG GCG GGC GAC GAG ACC GTA CGG CTG CTC TGC ACC GAC CTC 
He Ser Pro Ala Gly Asp Glu Thr Val Arg Leu Leu Cys Thr Asp Leu 
2705 2710 2715 

GCG ACC GGC GCC CCC GTG CTG GAG ATC GAC GAA CTG GTC GTC CGC CCG 
Ala Thr Gly Ala Pro Val Leu Glu He Asp Glu Leu Val Val Arg Pro 
2720 2725 2730 

GTG TCC GGC GAG CAG TTG GCG GCC GGC GCC CCG GGC CGC AAC GGC GGC 
Val Ser Gly Glu Gin Leu Ala Ala Gly Ala Pro Gly Arg Asn Gly Glv 
2735 2740 2745 

GAG CTG TAC CGG GTC GAC TGG ACG GTG CTG CCG GAG CCC GCC GAG GTG 
Glu Leu Tyr Arg Val Asp Trp Thr Val Leu Pro Glu Pro Ala Glu Val 
2750 2755 2760 2765 

CCC GCG CCG CGC TGG GCC CTC CTC GGC GAG GAC CAC GCC GGC CTG GCC 
Pro Ala Pro Arg Trp Ala Leu Leu Gly Glu Asp His Ala Gly Leu Ala 
2770 2775 2780 

GAT GTG CTC GGA GGG ACG GGC GGC GGC TGC GAG CGG TAC GAC ACC CTC 
Asp Val Leu Gly Gly Thr Gly Gly Gly Cys Glu Arg Tyr Asp Thr Leu 
2785 2790 2795 

ACC GGC CTG CTG GAG GCC ACC ACC CGG TCG GCC GGC GGA ATC CTG CCC 
Thr Gly Leu Leu Glu Ala Thr Thr Arg Ser Ala Gly Gly lie Leu Pro 
2800 2805 2810 
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GAC ATC GTC GCG CTC TCC TTG CCC ACC GCC CCG GAG CCC GGC CCC CAG 
Asp He Val Ala Leu Ser Leu Pro Thr Ala Pro Glu Pro Gly Pro Gin 
2815 2820 2825 

GCG GTG CGC GAG GTG CTG TCC CAG GCG CTC GAC GCC GCC CAG GCG TCG 
Ala Val Arg Glu Val Leu Ser Gin Ala Leu Asp Ala Ala Gin Ala Trp 
2830 2835 2840 2845 

CTG GCC GCC GGC GCC GAG ACC GCC TCC GCC CGG CTG GTG TTC GTC ACC 
Leu Ala Ala Gly Ala Glu Thr Ala Ser Ala Arg Leu Val Phe Val Thr 
2850 2855 2860 

GGC GGC GCG GTG GCC ACC ACG GCG GAC GAA ACC GTG CGC GAC ATC GCG 
Gly Gly Ala Val Ala Thr Thr Ala Asp Glu Thr Val Arg Asp He Ala 
2865 2870 2875 

GCG GCC GCC GTC TGG GGC CTG GTC CGC TCG GCG CAG TCC GAG GAA CCC 
Ala Ala Ala Val Trp Gly Leu Val Arg Ser Ala Gin Ser Glu Glu Pro 
2880 2885 2890 

GAC CGC ATG GTC CTG CTC GAC CTG GAC GGC GAG CGG CCC ACC GCG CGG 
Asp Arg Met Val Leu Leu Asp Leu Asp Gly Glu Arg Pro Thr Ala Arg 
2895 2900 2905 

ACG CTG GCG GCG GCG CTC GCG TCC GGC GAA CCG CAA CTC GCC GTG CGC 
Thr Leu Ala Ala Ala Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg 
2910 2915 2920 2925 

GGC TCC ACG GTG GCC GCT CCC CGG CTG GCC CCG GCC GGG CCC GGC CCG 
Gly Ser Thr Val Ala Ala Pro Arg Leu Ala Pro Ala Gly Pro Gly Pro 
293 ° 2935 2940 

GAG GAC CTC GTA CCG CCC GCC GGC ACC ACC GCC TGG CGG CTC ACC CCC 
Glu Asp Leu Val Pro Pro Ala Gly Thr Thr Ala Trp Arg Leu Thr Pro 
2945 2950 2955 

GGC GGG GGG ACG CTG GAG GAA CTG TCG CTC GCG CCC GCC CCC GAC GCG 
Gly Gly Gly Thr Leu Glu Glu Leu Ser Leu Ala Pro Ala Pro Asp Ala 
2960 2965 2970 

GAG GAA CCA CTG GCA CCG GGC CAG GTA CGC ATC GCC GTC CGC GCG GCG 

^ c Pr ° LeU Ala Pr ° Gly Gln Val Ar $ Ile A1 * Val Arg Ala Ala 
2975 2980 2985 

GGC GTG AAC TTC CGC GAC GCC CTG ATC GCC CTC GGC ATG TAC CCG GGC 
Gly Val Asn Phe Arg Asp Ala Leu Ile Ala Leu Gly Met Tyr Pro Gly 
2990 2995 3000 3005 

AAG GGA ACC ATG GGC GCC GAG GGA GCC GGC GTC GTC GTC GAG ACC GCC 
Lys Gly Thr Met Gly Ala Glu Gly Ala Gly Val Val Val Glu Thr Ala 
3010 3015 3020 

CCC GAT GTC ACC GGC CTC TCC GCC GGA GAC CGC GTG CTC GGC ATC TCG 
Pro Asp Val Thr Gly Leu Ser Ala Gly Asp Arg Val Leu Gly Met Trp 
3 °25 3030 3035 

AAC GGC GGC TTC GGG CCC CTC GTG GTG GCC GAC CAC CGC ATG GTG GCC 
Asn Gly Gly Phe Gly Pro Leu Val Val Ala Asp His Arg Met Val Ala 
3040 3045 3050 
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CCG ATC CCC CAC GGC TCG TCG TAC GCC GAG GCG GCC TCC GTG CCC GCC 29216 
Pro He Pro His Gly Trp Ser Tyr Ala Glu Ala Ala Ser Val Pro Ala 
3055 3060 3065 

GTG CTC CTC ACC TCC TAC TAC GCG CTG ACC CGG CTG GCC CGG GCC CGC 29264 
Val Leu Leu Thr Ser Tyr Tyr Ala Leu Thr Arg Leu Ala Arg Ala Arg 
3070 3075 3080 3085 

ACC GGA CAG ACC GTC CTC GTC CAC GCC GCC GCC GGC GGT GTC GGC ATC 29312 
Thr Gly Gin Thr Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met 
3090 3095 3100 

GCG ACC CTC CAA CTC GCC CGC CAC CTC GGC CTG GAG GTG TAC GCC ACC 29360 
Ala Thr Leu Gin Leu Ala Arg His Leu Gly Leu Glu Val Tyr Ala Thr 
3105 3110 3115 

GCG AGC ACC GGC AAA TGG GAC GCC CTG CAG AAG CAC GGC ATC CCC GAC 29408 
Ala Ser Thr Gly Lys Trp Asp Ala Leu Gin Lys His Gly He Pro Asp 
3120 3125 3130 

GAC CGC ATC GCC GAC TCC CGC ACC CTG GAC TTC GCC GAG CGC TTC CTG 29456 
Asp Arg He Ala Asp Ser Arg Thr Leu Asp Phe Ala Glu Arg Phe Leu 
3135 3140 3145 

TCC CGG ACG GGC GGC CGG GGT GTC GAC ATC GTG CTG AAC TCC CTC GCC 29504 
Ser Arg Thr Gly Gly Arg Gly Val Asp He Val Leu Asn Ser Leu Ala 
3150 3155 3160 3165 

GGC GAG TTC GTC GAC GCC TCA CTG CGG CTG CTG CCG CGC GGC GGG CAC 29552 
Gly Glu Phe Val Asp Ala Ser Leu Arg Leu Leu Pro Arg Gly Gly His 
3170 3175 3180 

TTC CTG GAA CTC GGC AAG GCC GAC GTC CGC GAC CCC CGG CGG ATC GCC 29600 
Phe Leu Glu Leu Gly Lys Ala Asp Val Arg Asp Pro Arg Arg He Ala 
3185 3190 3195 

GCC GCC CAT CCG GGC ACC GAC TAC CGG GCG TTC GAC CTG GTG CAG GCC 29648 
Ala Ala His Pro Gly Thr Asp Tyr Arg Ala Phe Asp Leu Val Gin Ala 
3200 3205 3210 

GGT CCC GAC ACC GTC GGG GAG ATG CTC GGG GAA CTG CTG GAA CTG TTC 2969 6 

Gly Pro Asp Thr Val Gly Glu Met Leu Gly Glu Leu Leu Glu Leu Phe 
40 3215 3220 3225 

GCG GCC GGA GCG CTG CGC CCG CTG CCG CTC ACC GCC TAC GGC ATA CGC 29744 
Ala Ala Gly Ala Leu Arg Pro Leu Pro Leu Thr Ala Tyr Gly He Ara 
3230 3235 3240 3245 

45 GAC GCC CGC ACC GCC TTG CGC ACC CTC AGC CAG GCC CGG CAC ACC GGC 29792 

Asp Ala Arg Thr Ala Leu Arg Thr Leu Ser Gin Ala Arg His Thr Gly 
3250 3255 32 60 

AAG CTC GTG CTG ACG GTG CCT GCC GGA TTC GAC ACC CAC CGC ACG GTG 29840 
Lys Leu Val Leu Thr Val Pro Ala Gly Phe Asp Thr His Arg Thr Val 
3265 3270 3275 

CTC CTC ACC GGC GGC ACG GGC ACG CTC GGC CAG ACA CTC GCC CGC CAT 29888 
Leu Leu Thr Gly Gly Thr Gly Thr Leu Gly Gin Thr Leu Ala Arg His 
55 3280 3285 32 90 



TO 



75 



20 



25 



30 



35 
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CTG GTC AAC CGC CAC GGC GTA CGG CAC CTG CTG CTC GCC GGC CGC ACC 
Leu Val Asn Arg His Gly Val Arg His Leu Leu Leu Ala Gly Arg Thr 
3295 3300 3305 

GGC GCG GCG GCC GAG GGC GTC GCG GAA CTG ATT GGT GAA CTC GGC GAG 
Gly Ala Ala Ala Glu Gly Val Ala Glu Leu He Gly Glu Leu Gly Glu 
3310 3315 3320 3325 

TTG GGC GCC GAG GTC CGG GTC GCG GCC TGC GAC GCG GCC GAC CGG CAG 
Leu Gly Ala Glu Val Arg Val Ala Ala Cys Asp Ala Ala Asp Arg Gin 
33 30 3335 3340 

CGG CTC ACC GAA CTC CTC GCC GGA ATC CCC GTC GAG CAC CCG CTC GGC 
Arg Leu Thr Glu Leu Leu Ala Gly He Pro Val Glu His Pro Leu Gly 
3345 3350 3355 

GCC GTC GTC CAC GCC GCG GGC ACC CTC GAC GAC GGC ACC ATC CCC TCA 
Ala Val Val His Ala Ala Gly Thr Leu Asp Asp Gly Thr lie Pro Ser 
3360 3365 3370 

CTG ACC GGC GAG AAC ATC GAC AAC GTC CTG CGG CCC AAG GCC GAC GCC 
Leu Thr Gly Glu Asn He Asp Asn Val Leu Arg Pro Lys Ala Asp Ala 
3375 3380 3385 

GTG CTC AAC CTG CAC GAG CTG ACC CGC GAC GCC GAC CTC TCG GCG TTC 
Val Leu Asn Leu His Glu Leu Thr Arg Asp Ala Asp Leu Ser Ala Phe 
3390 3395 3400 3405 

GTC CTC TAC TCG TCC TCC TCG GCG CTG CTC GGC AGC CCC GGC CAG GGC 
Val Leu Tyr Ser Ser Ser Ser Ala Leu Leu Gly Ser Pro Gly Gin Gly 
3410 3415 3420 

GCC TAC GCC GCG GCC AAC GCC TTC CTG GAC GGC TTC GCC CGA TAC CGC 
Ala Tyr Ala Ala Ala Asn Ala Phe Leu Asp Gly Phe Ala Arg Tyr Arg 
3425 3430 3435 

AAG GGC CTC GGG CTG CCG GCG CTC TCG CTC GCC TCG GGA CTC TCG GGC 
Lys Gly Leu Gly Leu Pro Ala Leu Ser Leu Ala Trp Gly Leu Trp Gly 
3440 3445 3450 

AGC AAC AGC CGC ATC GCG GGC CAC CTC GAC CAG TCG GGC ATC CAA CGG 
Ser Asn Ser Arg Met Ala Gly His Leu Asp Gin Ser Gly Met Gin Arg 
3455 3460 3455 

CGC CTC AAC CGG AGC GGC ATC ATC GCG CTC ACC GAC GCC GAG GGC CTC 
Arg Leu Asn Arg Ser Gly He Met Ala Leu Thr Asp Ala Glu Gly Leu 
3470 3475 3480 3485 

GCC CTC TTC GAC GCC GCA CAG GAC GGC GGG GAC GCG CTC CTC GTG CCG 
Ala Leu Phe Asp Ala Ala Gin Asp Gly Gly Asp Ala Leu Leu Val Pro 
3490 3495 3500 

ATC CGG CTC AAC CGG ACG GCC CTT CGC GCC TCG GGA CGG ATC ACC CCG 
Met Arg Leu Asn Arg Thr Ala Leu Arg Ala Ser Gly Arg He Thr Pro 
3505 3510 3515 

TTC CTC AGC GGC TTG GCC GGC GGC GGG CCG GCG GCG GGG GAG AGG CGC 
Phe Leu Ser Gly Leu Ala Gly Gly Gly Pro Ala Ala Gly Glu Arg Arg 
3520 3525 3530 
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CCC GAG GTG GCA GCC GTA TCC GGG ACA CTC GCG GAA CGG CTG ACC GGG 
Pro Glu Val Ala Ala Val Ser Gly Thr Leu Ala Glu Arg Leu Thr Glv 
353 5 3540 3545 

CTC ACG GCA CAG GAA GGG CAC GCC CTC GTC CTG GCC GAG ATC CGC GCC 
Leu Thr Ala Gin Glu Gly His Ala Leu Val Leu Ala Glu He Arg Ala 
3550 3555 3560 



3565 



CAC GCG GCG GCG GTG CTG GGC CAC GGC TCC GAC GAC TCG ATC CCC GAG 
His Ala Ala Ala Val Leu Gly His Gly Ser Asp Asp Ser He Pro Glu 
3570 3 575 358O 

GAC CGG GCC TTC AAG GAC CTC GGC TTC GAC TCG CTC ACC GCC GTG GAG 
Asp Arg Ala Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu 
3585 3590 3595 

ATC CGC AAC CGG CTG AGC GCG GCC ACC GGC CTC CGG CTG CCC GCC ACC 
Met Arg ten Arg Leu Ser Ala Ala Thr Gly Leu Arg Leu Pro Ala Thr 
3600 3605 3610 

CTC GTC TTC GAC CAC CCG ACC CCG GGC GAG CTG GCC GGC CAC CTG AGT 
Leu Val Phe Asp His Pro Thr Pro Gly Glu Leu Ala Gly His Leu Ser 
3615 3 620 3625 

GOT GAA CTG TCC GCC GAC GAT GCC CCG GGC AGC GCC TCC CCG CTT ACC 
Ala Glu Leu Ser Ala Asp Asp Ala Pro Gly Ser Ala Ser Pro Leu Thr 
3630 3 «5 3640 3645 

GAA CTC GAC CGT TTC GAA GCC CTG TTC ACC GCT CTC GCA CCG GGG ACC 
Glu Leu Asp Arg Phe Glu Ala Leu Phe Thr Ala Leu Ala Pro Gly Thr 
3650 3 655 3 6 6o 

ACC AAG GAC ACC CCG GGC GGG GCC GGG GCA CTG ATG ATC GAC GAG GCC 
Thr Lys Asp Thr Pro Gly Gly Ala Gly Ala Leu Met He Asp Glu Sa 
3665 3670 3675 

GAG CGC CAA GAG ATC GCC GGG CGG CTC GCG GCG CTG GCC GGT CTG TGG 
Glu Arg Gin Glu lie Ala Gly Arg Leu Ala Ala Leu Ala Gly Leu Trp 
3680 3 685 3690 

AAC CGG CTG CAC GGC ACC ACG ACG GCT CCT GAG GAC GGC GAC ACC GTC 
Asn Arg Leu His Gly Thr Thr Thr Ala Pro Glu Asp Gly Asp J£ vll 

3700 3705 

GCG GAC GCC CTG GAA GCC GCG GAC GAC CAC GAG ATC TTC GCA TTC CTC 
Ala Asp Ala Leu Glu Ala Ala Asp Asp His Glu lie Phe Ala S 2S 
10 3715 5720 3725 

Z Su £ ™ GCCCGCCCCA G «*CAGCAC AGGTCAAAAC AC ATG GCC 

3730 J* Ala 

AAC GCG AAC GAG CAG CAA CTC CGT GCC TAT CTG AAG CGA GCG ACG ACC 
Asn Ala Asn Glu Gin Gin Leu Arg Ala Tyr Leu Lys Arg Ala Thr ihr 
5 10 15 

SI f T £ C GAA CAA ^ AGG GAG GAG CGG GCA CGG GCC 

Glu Leu His Arg Thr Ser Glu Gin Leu Arg Glu Glu Arg Ala Arg Ala 

25 30 
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CAC GAG CCG ATC GCC GTC CTC GGC ATG GCC TGC CGC TAC CCC GGA GGC 31381 

His Glu Pro He Ala Val Val Gly Met Ala Cys Arg Tyr Pro Gly Gly 
35 40 45 50 

GCG AAC ACC CCC GAA CAG TTC TGG GAA CTG CTC GAC ACC GGC ACC GAC 31429 
Ala Asn Thr Pro Glu Gin Phe Trp Glu Leu Leu Asp Thr Gly Thr Asp 
S5 60 65 

GCC GCC GCG CCG ATG CCC TCC GAC CGG GGA TGG GAC ACC CAC GGG CTG 31477 
Ala Ala Ala Pro Met Pro Ser Asp Arg Gly Trp Asp Thr His Gly Leu 
7 ° 75 80 

TAC GAC CCC GAC CCG GCG GCA GCG GGG CGC ACC TAC TGC CGG GAG GGC 
Tyr Asp Pro Asp Pro Ala Ala Ala Gly Arg Thr Tyr Cys Arg Glu Gly 
85 go 95 

GGC TTC CTC CAC GAC GCG GGC GAC TTC GAC GCG GAC TTC TTC GGG ATT 
Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Asp Phe Phe Gly He 
100 105 no 

TCG CCG CGT GAG GCG GTG GCG ATG GAT CCG CAG CAG CGG CTG TTG CTG 31621 
Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu 
115 120 125 130 

° AG ACC l CC GAG GCG ATC GAA GCG GCC GGT ATC GAC CCG CGA GGA 31669 

Glu Thr Ser Trp Glu Ala He Glu Ala Ala Gly lie Asp Pro Arg Gly 
135 140 145 

CTC CGC GGC AGC CGC ACC GGG GTG TAC GTG GGC GCC TGG GAC AGC GGC 
Leu Arg Gly Ser Arg Thr Gly Val Tyr Val Gly Ala Trp Asp Ser Gly 
150 155 160 

TAC ACC GGC CAG GCG CAC GCG CCC TCG GCC GAG TTG GAG GCC GAC CTG 
Tyr Thr Gly Gin Ala His Ala Pro Ser Ala Glu Leu Glu Ala Asp Leu 
165 170 175 



TCC AAG GCG TTC GCC GAC GCC GCC GAC GGC TTC GGC CCC GCC GAG GGT 
Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Phe Gly Pro Ala Glu Gly 
55 260 265 270 



31525 



31573 



31717 



31765 



31861 



as f TC ^° 000 GGC GTC GTC AGC ACC TC C GGC CGT ATC GCC TAC ACG 31813 

Leu Thr Gly Gly Val Val Ser Phe Thr Ser Gly Arg lie Ala Tyr J£ 
180 185 190 

CTG GGG CTG GAG GGT CCG GCC TTG ACC GTG GAC ACC GCG TGT TCG TCG 
Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser 
40 195 200 205 210 

TCG CTG GTC GCC CTG CAC AAC GCG GCG CAG GCG CTG CGG CGC GGC GAA 31909 
Ser Leu Val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Arg Gly Glu 
215 220 225 

TGC GAC CTG GCG TTG GCC GGT GGT GTG ACG GTG ATG GCG ACC CCG GCG 
Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Ala 
230 235 240 

GTG TTC GTC CAG TTC GCC CGG CAG CGA GGG CTG GCG CCG GAC GGC CGC 3? on* 

Val Phe Val Gin Phe Ala Arg Gin Arg Gly Leu Ala Pro £p SJ Sg 
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GTG GGG ATG GTG CTG GTG GAG CGG TTG TCG GAT GCC CGG CGG TTG GGG 32101 
Val Gly Met Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly 
275 280 285 290 

CAT CCG GTG TTG GCG GTG GTG TGT GGG TCG GCG GTG AAT CAG GAC GGT 32149 
His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly 
295 300 305 

GCG TCG AAT GGT TTG ACG GCG CCG AGT GGT CCG TCG CAG GAG CGG GTG 32197 
Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val 
310 315 320 

ATT CGT CAG GCG TTG GGG AAT GCG CGG TTG ACG GTG GCG GAT GTG GAT 32245 
lie Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp Val Asp 
325 330 335 

GTG GTG GAG GCG CAT GGG ACG GGG ACG CGG CTG GGT GAT CCG ATC GAG 32293 
Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro lie Glu 
340 345 350 

GCG CAG GCG TTG CTG GGG ACG TAT GGG CGG GAT CGT GAT GGT GGG CGT 32341 
Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly Gly Arg 
355 360 365 370 

CCG GTG TGG TTG GGG TCG TTG AAG TCG AAT ATT GGT CAT GCT CAG GCG 32389 
Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
375 380 385 

GCT GCG GGG GTG GCT GGT GTG ATC AAG ATG GTG TTG GCG ATG CGG TAT 32437 
Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Met Arg Tyr 
390 395 400 

GGG TGG TTG CCG CGG ACG TTG CAT GTG GAT GAG CCG AGC CGG CAT GTG 32485 
Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val 
405 410 415 

GAC TGG TCG GCT GGT GGT GTG CGG TTG CTG ACC GAG GCG CGG GAG TGG 32533 
Asp Trp Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg Glu Trp 
420 425 430 



CCG GGG GTG GAC CGG CCG CGT CGG GCG GCG GTC TCC GCC TTT GGT GTC 32581 
Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val 
40 435 440 445 450 

AGT GGT ACC AAC GCC CAT CTG ATC CTC GAA GCC CCC GAC ACC GCC GAG 32629 
Ser Gly Thr Asn Ala His Leu He Leu Glu Ala Pro Asp Thr Ala Glu 
455 460 465 



GCG GAG AGC GCC ACG ACC CCG GTC CGC TCT GAG GTG TCG GAG TCT GCT 32677 
Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu Ser Ala 
470 475 480 

GCG GTC CTC GAT GCC CGC AGT GGT GTG GTG CCG GTG GTG GTT TCG GGG 32725 
Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val Ser Gly 
485 490 495 

CGT TCG CGG GTG GTG GTG CGG GAG GCT GCG GGC CGG TTG GCG GAG GTG 32773 
Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala Glu Val 
500 505 510 
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GTG GAG GCC GGT GGT GTG GGG CTG GCG GAT GTG GCG GTG ACG ATG GCG 32821 
Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr Met Ala 
515 520 525 530 

GGC CGG TCG CGG TTT GGG TAT CGG GCG GTT GTG CTG GCT CGG GGT GAG 32869 
Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg Gly Glu 
535 540 545 

GCT GAG CTT GCC GGG CGT TTG CGG GCG TTG GCG GGG GGT GAT CCG GAC 32917 
Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp Pro Asp 
55 0 555 560 

GCG GGT GTG GTC ACG GGT GCG GTG GTG GAC CCG GAG ACG GGG TCC GGT 32965 
Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly Ser Gly 
565 570 575 

GGT GGG GGG GTG GTG TTG GTT TTC CCT GGT CAG GGG ACG CAG TGG GTG 33013 
Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin Trp Val 
580 585 590 

GGG ATG GGT GCG GGG CTG CTG GGG TCT TCG GAG GTG TTT GCG GCG TCG 33061 
Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala Ala Ser 
595 600 605 610 

ATG CGG GAG TGT GCG CGG GCG CTG AGT GTT CAT GTG GGG TGG GAT TTG 33109 
Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp Asp Leu 
615 620 625 

CTG GAG GTG GTG TCG GGC GGG GCC GGG TTG GAG CGG GTG GAT GTG GTC 33157 
Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp Val Val 
630 635 640 

CAG CCG GTG ACG TGG GCG GTG ATG GTG TCG CTG GCC CGG TAC TGG CAG 33205 
Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr Trp Gin 
645 650 655 

GCG ATG GGT GTG GAC GTG GCT GCG GTG GTG GGT CAT TCC CAG GGG GAG 33253 
Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin Gly Glu 
660 665 670 



ATC GCT GCT GCC ACG GTG GCG GGG GCG TTG TCG CTG GAG GAT GCG GCG 33301 
lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala 
40 6 7 5 6 8 o 685 *- 69Q 

GCT GTG GTC GCT CTG CGG GCG GGG TTG ATT GGC CGG TAT CTG GCG GGT 33349 
Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu Ala Gly 
695 700 705 



CGT GGT GCG ATG GCG GCT GTT CCG CTC CCT GCC GGC GAG GTC GAG GCC 33397 
Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val Glu Ala 
710 715 720 

GGG CTC GCG AAG TGG CCG GGT GTG GAG GTC GCG GCG GTC AAC GGT CCG 33445 
so Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn Gly Pro 

7 25 730 735 

GCG TCT ACG GTG GTT TCC GGG GAT CGG CGG GCG GTC GCC GGT TAT GTG 33493 
Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly JJr S3 
55 740 745 750 
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GCC GTC TGT CAG GCG GAG GGT GTG CAG GCT CGG TTG ATA CCG GTG GAC 33541 
Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu lie Pro Val Asp 
755 760 765 770 

TAC GCC TCT CAC TCC CGC CAT GTG GAG GAC CTG AAG GGC GAG TTG GAG 33589 
Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu Leu Glu 
775 780 785 

CGG GTG CTG TCC GGT ATC CGC CCC CGC AGT CCG CGG GTG CCG GTG TCT 33 637 

Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro Val Cvs 
79 ° 795 800 

TCC ACC GTC GCC GGA GAG CAG CCG GGC GAG CCG GTT TTC GAT GCG GGG 33 685 

Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp Ala Gly 
805 810 815 

TAT TGG TTC CGT AAT CTG CGG AAC CGG GTT GAG TTC TCC GCG GTG GTC 33733 
Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala Val Val 
820 825 830 

GGT GGT TTC TTG GAG GAG GGC CAC CGT CGG TTC ATC GAG GTC AGT GCC 
Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val Ser Ala 
835 840 8 45 850 

CAC CCG GTA CTC GTC CAT GCC ATT GAG CAG ACG GCC GAG GCC GCG GAC 33829 
His Pro Val Leu Val His Ala lie Glu Gin Thr Ala Glu Ala Ala Asp 
855 860 865 

CGG AGT GTC CAT GCC ACC GGG ACC CTC CGC CGC CAG GAC GAC AGC CCG 
Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro 
870 875 880 

CAC CGC CTG CTC ACC TCC ACC GCC GAG GCC TGG GCC CAC GGC GCC ACC 
His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly Ala Thr 
88 5 890 895 



ACC TAC CCC TTC AAC CAC CAC CAC TAC TGG CTC GAC ACC ACC CCC ACC 
Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr Thr Pro Thr 
40 915 920 925 930 

ACC CCC GCG ACG ACC ACC CAG AGC CCC ACC GAT GCC TGG CGC TAC CGC 
Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg Tyr Arg 
935 940 945 



55 



CTC GGG GAC CGG GCC GCC GGC GCC CTC GCA CGT CAG GGG GCC ACG GTC 
Leu Gly Asp Arg Ala Ala Gly Ala Leu Ala Arg Gin Gly Ala Thr Val 
980 985 990 



33781 



33877 



33925 



35 r ™ C GAC CCC GCC CTG CCC CCA CAC CTC ACC ACC CTC CCC 33973 

Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr Leu Pro 
900 905 9 10 



34021 



34069 



34117 



GTC ACC TGG AAA GCC CTC ACC GAA GAA TCC ACT CCG GCC TCG TCC CCC 
Val Thr Trp Lys Ala Leu Thr Glu Glu Ser Thr Pro Ala Ser Ser Pro 
950 955 960 

TCC GGT CAC TCG CTC CTC GTC ACA CCC CCG ACC CCC GAA GGC CGC ACG 34165 
so ser Gly His Trp Leu Leu Val Thr Pro Pro Thr Pro Glu Gly Arg Thr 

9 65 970 975 
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GAA CGG CTG GTG GTC GAT CCG GTC GCC GTC GGA CGC GAC GGG CTC GCG 
Glu Arg Leu Val Val Asp Pro Val Ala Val Gly Arg Asp Gly Leu Ala 
995 1000 1005 1010 

GCG CGC CTG GGC GAA CGG TGG GAC GGT GTG CTG TCC CTG CTC GGC GCC 
Ala Arg Leu Gly Glu Arg Trp Asp Gly Val Leu Ser Leu Leu Gly Ala 
1015 1020 1025 

GAC GAG CGT CCG CTC CCA CGG CAT CCC GCC CTC AAC CGC GCC GTC ATG 
Asp Glu Arg Pro Leu Pro Arg His Pro Ala Leu Asn Arg Ala Val Met 
1030 1035 1040 

GGC ACC ACG CTG CTC GCC CAG GCC GCT CTG GAC GCA GGA TGC GAG GCG 
Gly Thr Thr Leu Leu Ala Gin Ala Ala Leu Asp Ala Gly Cys Glu Ala 
1045 1050 1055 

CGG ATA TGG GCC GTG ACG CGG GAG GCC GTC GCC GTC TCC CCG AGC GAG 
Arg He Trp Ala Val Thr Arg Glu Ala Val Ala Val Ser Pro Ser Glu 
1060 1065 1070 

GTG CCG CGG GAC GCC GGC GCG CAG CTC TGG GGG CTC GGG CGG GGC ATC 
Val Pro Arg Asp Ala Gly Ala Gin Leu Trp Gly Leu Gly Arg Gly He 
1075 1080 1085 1090 

GCG CTG GAA CAC CCC TCC CTC TGG GGC GGA TO ATC GAT CTG CCC GCC 
Ala Leu Glu His Pro Ser Leu Trp Gly Gly Leu He Asp Leu Pro Ala 
1095 HOO H05 

GTG CCG GAC GAA CGC GCG TGG GCC AGG GCC GTC CGG CGG CTC GTC CCG 
Val Pro Asp Glu Arg Ala Trp Ala Arg Ala Val Arg Arg Leu Val Pro 
111° H15 1120 

CAC GGT GAG GAC CAG ATC GCC GCG CGC GCC TCG GGT GCC TAT GGG CGC 
His Gly Glu Asp Gin He Ala Ala Arg Ala Ser Gly Ala Tyr Gly Arg 
1125 mo H35 

AGG CTC CTG CCG GCT CCG CCG GCC GCG TCG CGC CGC ACC TGC ACA CCG 
Arg Leu Leu Pro Ala Pro Pro Ala Ala Ser Arg Arg Thr Cys Thr Pro 
1140 H45 U50 

TCC GGC ACG GTG CTG GTC ACC GGC GGT ACG GGA GCG CTC GGC GGT CAT 
Ser Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly His 
1155 U60 U65 U70 

CTG GCC CGC CGT CTC GCA CGC GGC GGG ACC GGG CAT CTG GTG CTC ACC 
Leu Ala Arg Arg Leu Ala Arg Gly Gly Thr Gly His Leu Val Leu Thr 
H ?5 1180 H85 

AGC CGT CGC GGC CCG GAC GCG CCG GGC GCC GGT GAA CTC GCC GGT GAA 
Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Gly Glu Leu Ala Gly Glu 
H 9 0 1195 1200 

CTC GCC TCC CTG GGC GCG AAG GTC ACG GTC GCC GCG TGC GAC ATG GCC 
Leu Ala Ser Leu Gly Ala Lys Val Thr Val Ala Ala Cys Asp Met Ala 
1205 1210 1215 

GAC CGT GAA GCC GTG CGG GCG CTG CTC GAC GAG CAC CGG CCG ACC GCG 
Asp Arg Glu Ala Val Arg Ala Leu Leu Asp Glu His Arg Pro Thr Ala 
1220 1225 1230 
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GTG TTC CAC ACG GCG GGC ACG CCC CAC TCG GCG GAG TTC ACG GCG CTG 
Val Phe His Thr Ala Gly Thr Pro His Ser Ala Glu Phe Thr Ala Leu 
1235 1240 1245 1250 

GAC GAG ACG ACG ACG GCC GGG GTG TAC GGC GGG AAG GTC CTG GGT GCC 
Asp Glu Thr Thr Thr Ala Gly Val Tyr Gly Gly Lys Val Leu Gly Ala 
1255 1260 1265 

CGG CAT CTG GAC GAA CTG ACC CGG GAA CTC GGC ATC GGG CTG GAC GCG 
Arg His Leu Asp Glu Leu Thr Arg Glu Leu Gly He Gly Leu Asp Ala 
1270 1275 1280 

TTC GTC CTC TTC TCC TCC GGC GCC GCG GTC TGG GGC AGC GGC GGC CAG 
Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Gly Gly Gin 
1285 1290 1295 

ACC GCT TAC GGG GCC GCG AAC GCC GCG CTG GAC GCC CTC GCC GAG CGG 
Thr Ala Tyr Gly Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala Glu Arg 
1300 1305 1310 

CGC CGT GCC GCC GGG CTG CCC GCG ACC TCC GTC GCC TGG GGC CTG TGG 
Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly Leu Trp 

1330 



1315 1320 1325 



GGC GGC GGA GGC ATG GGG GAG GGG GAC GGT GAG GAG TTC CTC AGC CGG 
Gly Gly Gly Gly Met Gly Glu Gly Asp Gly Glu Glu Phe Leu Ser Arg 
1335 1340 1345 

CGC GGC CTC GGC GTG ATG CCG CCG GAG GAC GCG CTG GAA GCC CTG GAC 
Arg Gly Leu Gly Val Met Pro Pro Glu Asp Ala Leu Glu Ala Leu Asp 
1350 1355 1360 

CGG GCC CTG GAC CGG GAG GAC ACC ACC GTC GTG GTG GCG GAT GTC GAC 
Arg Ala Leu Asp Arg Glu Asp Thr Thr Val Val Val Ala Asp Val Asp 
1365 1370 1375 

TGG GAG CGG TTC GCC CCG GCC TTC ACC GCG TTC CGG CCC AGT GCG CTG 
Trp Glu Arg Phe Ala Pro Ala Phe Thr Ala Phe Arg Pro Ser Ala Leu 
1380 1385 1390 

ATC TCC CGG CTG GTC TCG GAC GGC GGG GAG GCG GGG GGG CAG GAC GCC 
He Ser Arg Leu Val Ser Asp Gly Gly Glu Ala Gly Gly Gin Asp Ala 
I 395 1400 1405 1410 

CCG GAC GGC ACG CTG TTC GCC GCC GGG TTC GCG GCC GCC GGG CCA CTG 
Pro Asp Gly Thr Leu Phe Ala Ala Gly Phe Ala Ala Ala Gly Pro Leu 
1415 1420 1425 

GAG CGG CAG GAG ATG CTG CTC GGC CTG GTG CGC CGG CAT GTG GCC GCC 
Glu Arg Gin Glu Met Leu Leu Gly Leu Val Arg Arg His Val Ala Ala 
1430 1435 1440 

GTA CTC GGC CAC CCG GGG ACC GCG GAC ATC GGT CCC GAC CGT GCT TTC 
Val Leu Gly His Pro Gly Thr Ala Asp He Gly Pro Asp Arg Ala Phe 
1445 1450 1455 

AAG GAG CTG GGG TTC AGT TCG GTC ACC GCC GTC GAG CTG GCC GGG CGG 
Lys Glu Leu Gly Phe Ser Ser Val Thr Ala Val Glu Leu Ala Gly Arg 
1460 1465 1470 
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CTC GGC CGG GAG TGC GGA CGG AAG CTG CCG CCG ACG CTG GTC TTC GAC 
Leu Gly Arg Glu Cys Gly Arg Lys Leu Pro Pro Thr Leu Val Phe Asp 
1475 1480 1485 1490 

CAT CCG ACT GCC GCG GCG GCC GTC GAA CAC CTG GCG GAG CTG CTG ACA 
His Pro Thr Ala Ala Ala Ala Val Glu His Leu Ala Glu Leu Leu Thr 
1495 1500 1505 

CCG CCC GCC GGT CCC GCC GCC GGT CCC CGG GAG GAG GAG GCG CGG GCC 
Pro Pro Ala Gly Pro Ala Ala Gly Pro Arg Glu Glu Glu Ala Arg Ala 
1510 1515 1520 

GCC CTG GCG CGC GTG CCG CTC GAA CGG CTG AGG GAA GCC GGC CTG CTG 
Ala Leu Ala Arg Val Pro Leu Glu. Arg Leu Arg Glu Ala Gly Leu Leu 
1525 1530 1535 

GAC GCA CTG CTG CGG CTC GCC GCG GAC GAA TCC GGG GCG ACA ACC CCC 
Asp Ala Leu Leu Arg Leu Ala Ala Asp Glu Ser Gly Ala Thr Thr Pro 
1540 1545 1550 

CGT ACG TCT GCC GCG TCC GGC GCA CCC CGC GGC CGG GAG GAG CCG GAC 
Arg Thr Ser Ala Ala Ser Gly Ala Pro Arg Gly Arg Glu Glu Pro Asp 
1555 1560 1565 1570 

GGC CGC GGC GAG CCG GAC GGC TCG GGA CAC CGC GAA AGC CCG GAC GCG 
Gly Arg Gly Glu Pro Asp Gly Ser Gly His Arg Glu Ser Pro Asp Ala 
1575 1580 1585 

GCC GGC GGG TCG GAC GCC CTG GAC GAT CTC GAC GGG GAC GCC CTC GTC 
Ala Gly Gly Ser Asp Ala Leu Asp Asp Leu Asp Gly Asp Ala Leu Val 
1590 1595 16Q0 

CGG CTC GCC CTC GGG GAA CCG GGC GAG TGA CCGGCCGGCG GAGCACACCC 
Arg Leu Ala Leu Gly Glu Pro Gly Glu * 
1605 i6io 

GGCCGTCTCC GGCCCGGCCG CGGCCGGGCC GGAAGCCATC CGCCGCCCAC CCGGTACCGA 

CCCCTCAAGC CCTTCAAGCC CTTCGACCCG TCCGATCAGT CAGTC CGGCG GTCCTCCACG 

ACCGGTCCGG AATCGCCCCC ACACGAGTCA GGAAGCACAC C ATG GCC ATG TCC 

Met Ala Met Ser 
1 

GCC GAG AGG CTC ACG GAG GCG CTG CGG ACC TCG CTC AAG GAG GCC GAG 
Ala Glu Arg Leu Thr Glu Ala Leu Arg Thr Ser Leu Lys Glu Ala Glu 
5 10 15 20 

CGG CTC CGG CGG CAG AAC CGC GAA CTC AGG GCC GCG CGG GAC GCG GCG 
Arg Leu Arg Arg Gin Asn Arg Glu Leu Arg Ala Ala Arg Asp Ala Ala 
25 30 35 

CGG GAG CCG ATC GCC GTC GTC GGC ATG GCC TGC CGC TAC CCG GGC GGT 
Arg Glu Pro He Ala Val Val Gly Met Ala Cys Arg Tyr Pro Gly Gly 
40 45 50 

GTC ACC GGC CCC GAG GAG CTC TCG GAG CTC GTG GCC GGA GGC CGG GAC 
Val Thr Gly Pro Glu Glu Leu Trp Glu Leu Val Ala Gly Gly Arg Asp 
55 60 6 5 
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GCG ATC GGG CCG TTC CCC GTG GAC CGG GGC TGG GAC GTG GCG TCG GTG 
Ala lie Gly Pro Phe Pro Val Asp Arg Gly Trp Asp Val Ala Ser Val 
70 75 80 



36500 



JO 



15 



20 



TAG GAC CCG GAT CCC GAG TCG AAG GGC ACC ACG TAC TGC CGG GAG GGC 36548 
Tyr Asp Pro Asp Pro Glu Ser Lys Gly Thr Thr Tyr Cys Arg Glu Gly 
85 90 95 100 

GGG TTC CTG GAA GGC GCC GGT GAC TTC GAC GCC GCC TTC TTC GGC ATC 36596 
Gly Phe Leu Glu Gly Ala Gly Asp Phe Asp Ala Ala Phe Phe Gly He 
105 HO us 

TCG CCG CGC GAG GCC CTG GTG ATG GAC CCG CAG CAG CGG CTG CTG CTG 36644 
Ser Pro Arg Glu Ala Leu Val Met Asp Pro Gin Gin Arg Leu Leu Leu 
120 125 130 

GAG GTG TCC TGG GAG GCG CTG GAA CGC GCG GGC ATC GAC CCG TCC TCG 36692 
Glu Val Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro Ser Ser 
135 140 145 

CTG CGC GGC AGC CGC GGT GGT GTC TAC GTG GGC GCC GCG CAC GGC TCG 36740 
Leu Arg Gly Ser Arg Gly Gly Val Tyr Val Gly Ala Ala His Gly Ser 
150 155 160 

TAC GCC TCC GAT CCC CGG CTG GTG CCC GAG GGC TCG GAG GGC TAT CTG 36788 
Tyr Ala Ser Asp Pro Arg Leu Val Pro Glu Gly Ser Glu Gly Tyr Leu 
25 165 170 175 180 

CTG ACC GGC AGC GCC GAC GCG GTG ATG TCC GGC CGC ATC TCC TAC GCG 4683 6 

Leu Thr Gly Ser Ala Asp Ala Val Met Ser Gly Arg He Ser Tyr Ala 
185 190 195 

CTC GGT CTC GAA GGA CCG TCC ATG ACG GTG GAG ACG GCC TGC TCC TCC 36884 
Leu Gly Leu Glu Gly Pro Ser Met Thr Val Glu Thr Ala Cys Ser Ser 
200 205 210 

TCG CTG GTG GCG CTG CAT CTG GCG GTA CGG GCG CTG CGG CAC GGC GAG 36932 
Ser Leu Val Ala Leu His Leu Ala Val Arg Ala Leu Arg His Gly Glu 
215 220 225 

TGC GGG CTC GCG CTG GCG GGC GGG GTG GCG GTG ATG GCC CAT CCG GCG 36980 
Cys Gly Leu Ala Leu Ala Gly Gly Val Ala Val Met Ala Asp Pro Ala 
230 235 240 

GCG TTC GTG GAG TTC TCC CGG CAG AAG GGG CTC GCC GCC GAC GGC CGC 37028 
Ala Phe Val Glu Phe Ser Arg Gin Lys Gly Leu Ala Ala Asp Gly Arg 
245 250 2 55 Y 260 

TGC AAG GCG TTC TCG GCC GCC GCC GAC GGC ACC GGC TGG GCC GAG GGC 37076 
Cys Lys Ala Phe Ser Ala Ala Ala Asp Gly Thr Gly Trp Ala Glu Gly 
265 270 275 

GTC GGC GTG CTC GTC CTG GAG CGG CTG TCG GAC GCG CGC CGC GCG GGG 37124 
Val Gly Val Leu Val Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly 
280 285 290 

CAC ACG GTC CTC GGC CTG GTC ACC GGC ACC GCG GTC AAC CAG GAC GGT 37172 
His Thr Val Leu Gly Leu Val Thr Gly Thr Ala Val Asn Gin Asp Gly 
55 295 300 305 
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CTC GCC GCC GCC CTG GAC GCG TTG GCC GAA GGA CGC GAC GGT CCG GCG 37940 
Leu Ala Ala Ala Leu Asp Ala Leu Ala Glu Gly Arg Asp Gly Pro Ala 
550 555 560 

GTG GTG CGC GGA GTC CGC GAC CGG GAC GGC AGG ATG GCC TTC CTC TTC 37988 
Val Val Arg Gly Val Arg Asp Arg Asp Gly Arg Met Ala Phe Leu Phe 
565 570 575 580 

ACC GGG CAG GGC AGC CAG CGC GCC GGG ATG GCC CAC GAC CTG CAT GCC 38036 
Thr Gly Gin Gly Ser Gin Arg Ala Gly Met Ala His Asp Leu His Ala 
585 590 595 

GCC CAT ACC TTC TTC GCG TCC GCC CTC GAC GAG GTG ACG GAC CGT CTC 38084 
Ala His Thr Phe Phe Ala Ser Ala Leu Asp Glu Val Thr Asp Arg Leu 
600 605 610 

GAC CCG CTG CTC GGC CGG CCG CTC GGC GCG CTG CTG GAC GCC CGA CCC 38132 
Asp Pro Leu Leu Gly Arg Pro Leu Gly Ala Leu Leu Asp Ala Arg Pro 
615 620 625 

20 GGC TCG CCC GAA GCG GCA CTC CTG GAC CGG ACC GAG TAC ACC CAG CCG 38180 

Gly Ser Pro Glu Ala Ala Leu Leu Asp Arg Thr Glu Tyr Thr Gin Pro 
630 635 640 
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GCG CTC TTC GCC GTC GAG GTG GCG CTC CAC CGG CTG CTG GAG CAC TGG 38228 
Ala Leu Phe Ala Val Glu Val Ala Leu His Arg Leu Leu Glu His Trp 
645 650 655 660 

GGG ATG CGC CCC GAC CTG CTG CTG GGG CAC TCG GTG GGC GAA CTG GCG 38276 
Gly Met Arg Pro Asp Leu Leu Leu Gly His Ser Val Gly Glu Leu Ala 
665 670 675 

GCC GCC CAC GTC GCG GGT GTG CTC GAT CTC GAC GAC GCC TGC GCG CTG 38324 
Ala Ala His Val Ala Gly Val Leu Asp Leu Asp Asp Ala Cys Ala Leu 
680 685 690 

GTG GCC GCC CGC GGC AGG CTG ATG CAG CGC CTG CCG CCC GGC GGC GCG 38372 
Val Ala Ala Arg Gly Arg Leu Met Gin Arg Leu Pro Pro Gly Gly Ala 
695 700 705 



ATG GTC TCC GTG CGG GCC GGC GAG GAC GAG GTC CGC GCA CTG CTG GCC 38420 
Met Val Ser Val Arg Ala Gly Glu Asp Glu Val Arg Ala Leu Leu Ala 
40 710 715 720 

GGC CGC GAG GAC GCC GTC TGC GTC GCC GCG GTG AAC GGC CCC CGG TCG 38468 

Gly Arg Glu Asp Ala Val Cys Val Ala Ala Val Asn Gly Pro Arg Ser 
725 730 735 740 

45 

GTG GTG ATC TCC GGC GCG GAG GAA GCG GTG GCC GAG GCG GCG GCG CAG 38516 

Val Val He Ser Gly Ala Glu Glu Ala Val Ala Glu Ala Ala Ala Gin 
745 750 755 

CTC GCC GGA CGA GGC CGC CGC ACC AGG CGG CTC CGC GTC GCG CAC GCC 38564 
Leu Ala Gly Arg Gly Arg Arg Thr Arg Arg Leu Arg Val Ala His Ala 
760 765 770 

TTC CAC TCA CCC CTG ATG GAC GGC ATG CTC GCC GGA TTC CGG GAG GTC 38612 
Phe His Ser Pro Leu Met Asp Gly Met Leu Ala Gly Phe Arg Glu Val 
55 775 780 785 
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GCC GCC GGC CTG CGC TAC CGG GAA CCG GAG CTG ACG GTC GTC TCC ACG 38660 
Ala Ala Gly Leu Arg Tyr Arg Glu Pro Glu Leu Thr Val Val Ser Thr 
790 795 800 

GTC ACG GGG CGG CCC GCC CGC CCC GGT GAA CTC ACC GGC CCC GAC TAC 38708 
Val Thr Gly Arg Pro Ala Arg Pro Gly Glu Leu Thr Gly Pro Asp Tyr 
805 810 815 820 

TGG GTG GCC CAG GTC CGT GAG CCC GTG CGC TTC GCG GAC GCG GTC CGC 38756 
Trp Val Ala Gin Val Arg Glu Pro Val Arg Phe Ala Asp Ala Val Arg 
825 830 835 

ACG GCA CAC CGC CTC GGA GCC CGC ACC TTC CTG GAG ACC GGC CCG GAC 38804 
Thr Ala His Arg Leu Gly Ala Arg Thr Phe Leu Glu Thr Gly Pro Asp 
840 845 850 

GGC GTG CTG TGC GGC ATG GCA GAG GAG TGC CTG GAG GAC GAC ACC GTG 38852 
Gly Val Leu Cys Gly Met Ala Glu Glu Cys Leu Glu Asp Asp Thr Val 
855 860 865 

GCC CTG CTG CCG GCG ATC CAC AAG CCC GGC ACC GCG CCG CAC GGT CCG 38900 
Ala Leu Leu Pro Ala He His Lys Pro Gly Thr Ala Pro His Gly Pro 
870 875 880 



GCG GCT CCC GGC GCG CTG CGG GCG GCC GCC GCC GCG TAC GGC CGG GGC 38948 
Ala Ala Pro Gly Ala Leu Arg Ala Ala Ala Ala Ala Tyr Gly Arg Gly 
25 885 890 895 900 

GCC CGG GTG GAC TGG GCC GGG ATG CAC GCC GAC GGC CCC GAG GGG CCG 38996 
Ala Arg Val Asp Trp Ala Gly Met His Ala Asp Gly Pro Glu Gly Pro 
905 910 915 



GCC CGC CGC GTC GAA CTG CCC GTC CAC GCC TTC CGG CAC CGC CGC TAC 39044 
Ala Arg Arg Val Glu Leu Pro Val His Ala Phe Arg His Arg Arg Tyr 
920 925 930 

TGG CTC GCC CCG GGC CGC GCG GCG GAC ACC GAC GAC TGG ATG TAC CGG 39092 
Trp Leu Ala Pro Gly Arg Ala Ala Asp Thr Asp Asp Trp Met Tyr Arg 
935 940 945 

ATC GGC TGG GAC CGG CTG CCG GCT GTG ACC GGC GGG GCC CGG ACC GCC 39140 
He Gly Trp Asp Arg Leu Pro Ala Val Thr Gly Gly Ala Arg Thr Ala 
40 9 5 0 9 5 5 9 6 0 

GGC CGC TGG CTG GTG ATC CAC CCC GAC AGC CCG CGC TGC CGG GAG CTG 39188 
Gly Arg Trp Leu Val He His Pro Asp Ser Pro Arg Cys Arg Glu Leu 
965 970 975 980 



TCC GGC CAC GCC GAA CGC GCG CTG CGC GCC GCG GGC GCG AGC CCC GTA 39236 
Ser Gly His Ala Glu Arg Ala Leu Arg Ala Ala Gly Ala Ser Pro Val 
985 990 995 

CCG CTG CCC GTG GAC GCT CCG GCC GCC GAC CGG GCG TCC TTC GCG GCA 39284 
Pro Leu Pro Val Asp Ala Pro Ala Ala Asp Arg Ala Ser Phe Ala Ala 
1000 1005 1010 

CTG CTG CGC TCC GCC ACC GGA CCT GAC ACA CGA GGT GAC ACA GCC GCG 39332 
Leu Leu Arg Ser Ala Thr Gly Pro Asp Thr Arg Gly Asp Thr Ala Ala 
55 1015 1020 1025 
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CCC GTG GCC GGT GTG CTC TCG CTG CTG TCC GAG GAG GAT CGG CCC CAT 
Pro Val Ala Gly Val Leu Ser Leu Leu Ser Glu Glu Asp Arg Pro His 
1030 1035 1040 

CGC CAG CAC GCC CCG GTA CCC GCC GGG GTC CTG GCG ACG CTG TCC CTG 
Arg Gin His Ala Pro Val Pro Ala Gly Val Leu Ala Thr Leu Ser Leu 
1045 1050 1055 1060 

ATG CAG GCT ATG GAG GAG GAG GCG GTG GAG GCT CGC GTG TGG TCC GTC 
Met Gin Ala Met Glu Glu Glu Ala Val Glu Ala Arg Val Trp Cys Val 
1065 1070 1075 

TCC CGC GCC GCG GTC GCC GCC GCC GAC CGG GAA CGG CCC GTC GGC GCG 
Ser Arg Ala Ala Val Ala Ala Ala Asp Arg Glu Arg Pro Val Gly Ala 
1080 1085 1090 

GGC GCC GCC CTG TGG GGG CTG GGG CGG GTG GCC GCC CTG GAA CGC CCC 
Gly Ala Ala Leu Trp Gly Leu Gly Arg Val Ala Ala Leu Glu Arg Pro 
1095 1100 1105 

ACC CGG TGG GGC GGT CTC GTG GAC CTG CCC GCC TCG CCC GGT GCG GCG 
Thr Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Ser Pro Gly Ala Ala 
1110 1115 H20 

CAC TGG GCG GCC GCC GTG GAA CGG CTC GCC GGT CCC GAG GAC CAG ATC 
His Trp Ala Ala Ala Val Glu Arg Leu Ala Gly Pro Glu Asp Gin He 
1125 U30 1135 1140 

GCC GTG CGC GCG TCC GGC AGT TGG GGC CGG CGC CTC ACC AGG CTG CCG 
Ala Val Arg Ala Ser Gly Ser Trp Gly Arg Arg Leu Thr Arg Leu Pro 
1145 H50 H55 

CGC GAC GGC GGC GGC CGG ACG GCC GCA CCC GCG TAC CGG CCG CGC GGC 
Arg Asp Gly Gly Gly Arg Thr Ala Ala Pro Ala Tyr Arg Pro Arg Gly 
1160 H65 H70 

ACG GTG CTC GTC ACC GGT GGC ACC GGC GCG CTC GGC GGG CAT CTC GCC 
Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly Gly His Leu Ala 
1175 U80 1185 

CGC TGG CTC GCC GCG GCG GGC GCC GAA CAC CTG GCG CTC ACC AGC CGC 
Arg Trp Leu Ala Ala Ala Gly Ala Glu His Leu Ala Leu Thr Ser Arg 
11*0 H95 1200 

CGG GGC CCG GAC GCG CCC GGC GCC GCC GGA CTC GAG GCC GAA CTC CTC 
Arg Gly Pro Asp Ala Pro Gly Ala Ala Gly Leu Glu Ala Glu Leu Leu 
1205 1210 1215 1220 

CTC CTG GGC GCC AAG GTG ACG TTC GCC GCC TCC GAC ACC GCC GAC CGC 
Leu Leu Gly Ala Lys Val Thr Phe Ala Ala Cys Asp Thr Ala Asp Arg 
1225 1230 1235 

GAC GGC CTC GCC CGG GTC CTG CGG GCG ATA CCG GAG GAC ACC CCG CTC 
Asp Gly Leu Ala Arg Val Leu Arg Ala He Pro Glu Asp Thr Pro Leu 
1240 1245 1250 

ACC GCG GTG TTC CAC GCC GCG GGC GTA CCG CAG GTC ACG CCG CTC TCC 
Thr Ala Val Phe His Ala Ala Gly Val Pro Gin Val Thr Pro Leu Ser 
12SS 1260 1265 
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CGT ACC TCG CCC GAG CAC TTC GCC GAC GTG TAC GCG GGC AAG GCG GCG 
Arg Thr Ser Pro Glu His Phe Ala Asp Val Tyr Ala Gly Lys Ala Ala 
1270 1275 1280 

GGC GCC GCG CAC CTG GAC GAA CTG ACC CGC GAA CTC GGC GCC GGA CTC 
Gly Ala Ala His Leu Asp Glu Leu Thr Arg Glu Leu Gly Ala Gly Leu 
12 *5 1290 1295 1300 

GAC GCG TTC GTC CTC TAC TCC TCC GGC GCC GGC GTC TGG GGC AGC GCC 
Asp Ala Phe Val Leu Tyr Ser Ser Gly Ala Gly Val Trp Gly Ser Ala 
1305 1310 1315 

GGC CAG GGT GCC TAC GCC GCC GCC AAC GCC GCC CTG GAC GCG CTC GCC 
Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala 
1320 1325 1330 

CGG CGC CGT GCG GCG GAC GGA CTC CCC GCC ACC TCC ATC GCC TGG GGC 
Arg Arg Arg Ala Ala Asp Gly Leu Pro Ala Thr Ser lie Ala Trp Gly 
1335 1340 1345 

GTG TGG GGC GGC GGC GGT ATG GGG GCC GAC GAG GCG GGC GCG GAG TAT 
Val Trp Gly Gly Gly Gly Met Gly Ala Asp Glu Ala Gly Ala Glu Tyr 
1350 1355 1360 

CTG GGC CGG CGC GGT ATG CGC CCC ATG GCA CCG GTC TCC GCG CTC CGG 
Leu Gly Arg Arg Gly Met Arg Pro Met Ala Pro Val Ser Ala Leu Arg 
1365 1370 1375 1380 

GCG ATG GCC ACC GCC ATC GCC TCC GGG GAA CCC TGC CCC ACC GTC ACC 
Ala Met Ala Thr Ala He Ala Ser Gly Glu Pro Cys Pro Thr Val Thr 
1385 1390 1395 

CAC ACC GAC TGG GAG CGC TTC GGC GAG GGC TTC ACC GCC TTC CGG CCC 
His Thr Asp Trp Glu Arg Phe Gly Glu Gly Phe Thr Ala Phe Arg Pro 
1400 1405 1410 

AGC CCT CTG ATC GCG GGG CTC GGC ACG CCG GGC GGC GGC CGG GCG GCG 
Ser Pro Leu He Ala Gly Leu Gly Thr Pro Gly Gly Gly Arg Ala Ala 
1415 1420 1425 

GAG ACC CCC GAG GAG GGG AAC GCC ACC GCT CCG GCG GAC CTC ACC GCC 
Glu Thr Pro Glu Glu Gly Asn Ala Thr Ala Ala Ala Asp Leu Thr Ala 
1430 1435 1440 

CTG CCG CCC GCC GAA CTC CGC ACC GCG CTG CGC GAG CTG GTG CGA GCC 
Leu Pro Pro Ala Glu Leu Arg Thr Ala Leu Arg Glu Leu Val Arg Ala 
1445 1450 1455 14 60 

CGG ACC GCC GCG GCG CTC GGC CTC GAC GA^ CCG GCC GAG GTC GCC GAG 
Arg Thr Ala Ala Ala Leu Gly Leu Asp Asp Pro Ala Glu Val Ala Glu 
1465 1470 1475 

GGC GAA CGG TTC CCC GCC ATG GGC TTC GAC TCC CTG GCC ACC GTA CGG 
Gly Glu Arg Phe Pro Ala Met Gly Phe Asp Ser Leu Ala Thr Val Arg 
14a0 H85 1490 

CTG CGC CGC GGA CTC GCC TCG GCC ACG GGC CTC GAC CTG CCC CCC GAT 
Leu Arg Arg Gly Leu Ala Ser Ala Thr Gly Leu Asp Leu Pro Pro Asp 
1495 1500 isos 



40100 



40148 



40196 



40244 



40292 



40340 



40388 



40436 



40484 



40532 



40580 



40628 



40676 



40724 



40772 
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CTG CTC TTC GAC CGG GAC ACC CCG GCC GCG CTC GCC GCC CAC CTG GCC 
Leu Leu Phe Asp Arg Asp Thr Pro Ala Ala Leu Ala Ala His Leu Ala 
1510 1515 1520 

GAA CTG CTC GCC ACC GCA CGG GAC CAC GGA CCC GGC GGC CCC GGG ACC 
Glu Leu Leu Ala Thr Ala Arg Asp His Gly Pro Gly Gly Pro Gly Thr 
1525 1530 1535 1540 

GGT GCC GCG CCG GCC GAT GCC GGA AGC GGC CTG CCG GCC CTC TAC CGG 
Gly Ala Ala Pro Ala Asp Ala Gly Ser Gly Leu Pro Ala Leu Tyr Arg 
1545 1550 1555 

GAG GCC GTC CGC ACC GGC CGG GCC GCG GAA ATG GCC GAA CTG CTC GCC 
Glu Ala Val Arg Thr Gly Arg Ala Ala Glu Met Ala Glu Leu Leu Ala 
1560 1565 1570 

GCC GCT TCC CGG TTC CGC CCC GCC TTC GGG ACG GCG GAC CGG CAG CCG 
Ala Ala Ser Arg Phe Arg Pro Ala Phe Gly Thr Ala Asp Arg Gin Pro 
1575 1580 1585 

GTG GCC CTC GTG CCG CTG GCC GAC GGC GCG GAG GAC ACC GGG CTC CCG 
Val Ala Leu Val Pro Leu Ala Asp Gly Ala Glu Asp Thr Gly Leu Pro 
1590 1595 1600 

CTG CTC GTG GGC TGC GCC GGG ACG GCG GTG GCC TCC GGC CCG GTG GAG 
Leu Leu Val Gly Cys Ala Gly Thr Ala Val Ala Ser Gly Pro Val Glu 
1605 1610 1615 1620 

TTC ACC GCC TTC GCC GGA GCG CTG GCG GAC CTC CCG GCG GCG GCC CCG 
Phe Thr Ala Phe Ala Gly Ala Leu Ala Asp Leu Pro Ala Ala Ala Pro 
1625 1630 1635 

ATG GCC GCG CTG CCG CAG CCC GGC TTT CTG CCG GGA GAA CGA GTC CCG 
Met Ala Ala Leu Pro Gin Pro Gly Phe Leu Pro Gly Glu Arg Val Pro 
1640 1645 1650 

GCC ACC CCG GAG GCA TTG TTC GAG GCC CAG GCG GAA GCG CTG CTC CGC 
Ala Thr Pro Glu Ala Leu Phe Glu Ala Gin Ala Glu Ala Leu Leu Arg 
1655 1660 1665 

TAC GCG GCC GGC CGG CCC TTC GTG CTG CTG GGG CAC TCC GCC GGC GCC 
Tyr Ala Ala Gly Arg Pro Phe Val Leu Leu Gly His Ser Ala Gly Ala 
1670 1675 1680 

AAC ATG GCC CAC GCC CTG ACC CGT CAT CTG GAG GCG AAC GGT GGC GGC 
Asn Met Ala His Ala Leu Thr Arg His Leu Glu Ala Asn Gly Gly Gly 
1685 1690 1695 1700 

CCC GCA GGG CTG GTG CTC ATG GAC ATC TAC ACC CCC GCC GAC CCC GGC 
Pro Ala Gly Leu Val Leu Met Asp lie Tyr Thr Pro Ala Asp Pro Gly 
1705 1710 1715 

GCG ATG GGC GTC TGG CGG AAC GAC ATG TTC CAG TGG GTC TGG CGG CGC 
Ala Met Gly Val Trp Arg Asn Asp Met Phe Gin Trp Val Trp Arg Arg 
1720 1725 1730 

TCG GAC ATC CCC CCG GAC GAC CAC CGC CTC ACG GCC ATG GGC GCC TAC 
Ser Asp He Pro Pro Asp Asp His Arg Leu Thr Ala Met Gly Ala Tyr 
1735 1740 1745 
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Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val Val Asp Ser Ala 
180 185 190 

Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys Glu Ser Leu Arg 



200 



205 



Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val Asn Leu He Leu 
210 215 220 

Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly Ala Leu Ser Pro 
225 230 235 240 

Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn Gly Tyr Val Arg 
245 250 255 

Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu Asp Ala Ala Leu 
260 265 270 



Ala Asp Gly Asp Arg Val Tyr Cys Val lie Lys Gly Gly Ala Val Asn 
275 280 285 

Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp Arg Glu Ala Gin 



295 300 
Glu Ala Val Leu Arg Gin Ala <iy r Arg Arg Ala Gly Val Ser Thr Gly 



310 



315 320 
Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr Arg Ala Gly Asp 



330 335 

Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly Ala Gly Ala Asp 
340 345 3 50 

Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val Lys Thr Asn Val 



360 



365 



Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu lie Lys Ala Thr 
J/u 375 380 

Leu cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu Asn Phe Ser Thr 
385 390 395 400 

Pro Asn Pro Asp He Pro Leu Asp Asp Leu Arg Leu Arg Val Gin Thr 



405 



410 



415 



Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro Arg Val Ala Gly 
420 425 ... * 



430 



Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His Leu Val II 



440 



e Ala 



445 



Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly Gly Ser Gly Ala 
" u 455 4 60 

Gly Ser Gly Ala Gly He Ser Ala Val Ser Gly Val Val Pro Val Val 
470 475 480 

Val ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu 
485 490 495 
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Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val 
500 505 510 

Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala 
515 520 525 

Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly 
530 535 540 

Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu Asp Gly Gly Val 
545 550 555 560 

Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly 
565 570 575 

Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly 
580 585 590 

Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser 
595 600 605 

Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val 
610 615 620 

His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu 
625 630 635 640 

Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser 
645 650 655 

Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val 
660 665 670 

Gly His Ser Gin Gly Glu lie Ala Ala Ala Thr Val Ala Gly Ala Leu 
675. 680 685 

Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He 
690 695 700 

Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro 
705 710 715 720 

Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val 
725 730 735 

Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg 
740 745 750 

Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala 
755 760 765 

Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp 
770 775 780 

Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser 
785 790 795 800 

Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu 
fi 05 810 815 
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Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val 
820 825 830 

Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg 
835 840 845 

Phe lie Glu Val Ser Ala His Pro Val Leu Val His Ala He Glu Gin 
850 855 860 

Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg 
865 870 875 880 

Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala 
885 890 895 

Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly 
900 905 910 

His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp 
915 920 925 

Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala Thr Gin Glu Lys 
930 935 940 

Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu Pro Ser Ser Gin 
945 950 955 960 

Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg His Thr Ala Val 
965 970 975 

Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro Asp Leu Ser Phe 
980 985 990 

Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu Leu Arg Asn Arg 
995 1000 1005 

Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr Leu Val Tyr Asp 
1010 1015 1020 

Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His Thr Glu Leu Leu 
1025 1030 1035 1040 

Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala Gin Lys Ser Phe 
1045 1050 1055 

Glu Ala Gin Glu Pro He Ala Val Val Gly Met Gly Cys Arg Phe Pro 
1060 1065 1070 

Gly Gly Val Gly Ser Pro Glu Ala Leu Trp Arg Leu Val Val Glu Gly 
1075 1080 1085 

Val Asp Ala Val Ser Pro Phe Pro Gly Asp Arg Gly Trp Asp Val Glu 
1090 1095 HOO 

Gly Leu Tyr Asp Pro Glu Pro Gly Val Ala Gly Lys Ser Tyr Val Arg 
1105 1110 1115 1120 

Glu Gly Gly Phe Leu His Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe 
1125 1130 H35 
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Gly lie Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu 
1140 1145 H50 

Leu Leu Glu Thr Ser Trp Glu Ala He Glu Arg Ala Gly He Asp Pro 
1155 H60 H65 

His Ser Leu His Gly Ser Arg Thr Gly Val Tyr Ala Gly Val Met Pro 
1170 1175 H80 

Gin Glu Tyr Gly Pro Arg Leu Ala Glu Gly Ala Glu Gly Ser Asp Gly 
1185 H90 1195 1200 

Tyr Leu Leu Thr Gly Thr Ser Gly Ser Val Val Ser Gly Arg Val Ala 
1205 1210 1215 

Tyr Thr Leu Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys 
1220 1225 1230 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Leu Arg Gly 
1235 1240 1245 

Gly Glu Cys Asp Met Ala Leu Ala Gly Gly Val Thr Val Met Ala Gly 
1250 1255 1260 

Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp 
1265 1270 1275 1280 

Gly Arg Cys Lys Ala Phe Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala 
1285 1290 1295 

Glu Gly Ala Gly Val Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
30 1 3 0 0 1305 1310 

Leu Gly His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin 
1315 1320 1325 



w 



15 



20 



25 



Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu 
1330 1335 1340 

Arg Val He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp 
1345 1350 1355 1360 

Val Asp Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro 
1365 1370 1375 

He Glu Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly 
1380 1385 1390 

Gly Arg Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala 
1395 1400 1405 

Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Met 
1410 1415 1420 

Arg Tyr Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg 
1425 1430 !435 1440 

His Val Asp Trp ser Ala Gly Gly Val Trp Leu Leu Thr Glu Ala Arg 
55 1445 1450 1455 
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Glu Trp Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
1460 1465 1470 

Gly Val Ser Gly Thr Asn Ala His Leu He Leu Glu Ala Pro Asp Thr 
1475 1480 1485 

Ala Glu Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu 
1490 1495 1500 

Ser Ala Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val 
1505 1510 1515 1520 

Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala 
1525 1530 1535 

Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr 
1540 1545 1550 

Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg 
1555 1560 1565 

Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asd 
i570 1575 1580 

Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly 
1585 1590 1595 16 5o 

Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin 
1605 1610 I615 

Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala 
16 20 1625 1630 

Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Glu Trp 
1635 1640 1645 

Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp 
1650 1655 1660 

Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr 
1665 1670 1675 1680 

Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin 
1635 1690 1695 

Gly Glu lie Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asd 
1700 1705 1710 

Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu 
1715 1720 1725 

Ala ?S A Arff Gly Ala Met Ala Ala Val Pro Leu Ala Gly Glu Val 
1730 1735 174Q 

Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn 
1745 17 50 1755 17 60 

Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly 
1765 1770 1775 
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Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu lie Pro 
1780 1785 1790 

Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu 
1795 1800 1805 

Leu Glu Arg Val Leu Ser Gly He Arg Pro Arg Ser Pro Arg Val Pro 
1810 1815 1820 

Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp 
1825 1830 1835 1840 

Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala 
1845 1850 1855 

Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe He Glu Val 
I860 1865 1870 

Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu Ala 
1875 1880 1885 

Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp 
1890 1895 1900 

Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly 
1905 1910 1915 1920 

Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr 
1925 1930 1935 

Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr Thr 
1940 1945 1950 

Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg 
1955 i960 1965 

Tyr Arg Val Thr Trp Lys Ala Leu Thr Glu Ser Ser Pro Val Arg Pro 
1970 1975 1980 

His Ser He Gly Arg Cys Leu Leu Val Ala Pro Pro Thr Thr Asp Gly 
1985 1990 1995 2000 

Glu Leu Leu Asp Gly Leu Thr Thr Val Leu Ser Glu Arg Gly Ala Ser 
2005 2010 2015 

Val Ala Arg Leu Glu Val Pro He Gly Ala Arg Arg Ala Glu Val Ala 
2020 2025 2030 

Glu Leu Leu Lys Pro Ser Met Glu Ser Ala Gly Glu Glu Asn Thr Thr 
2035 2040 2045 

Val Val Ser Leu Leu Gly Leu Val Pro Ser Thr Asp Ala Val Arg Thr 
2050 2055 2060 

Ser He Ala Leu Leu Gin Ala Val Ser Asp He Gly Val Pro Ala Ala 
2065 2070 2075 2080 

Arg Val Trp Ala Leu Thr Arg Arg Ala Val Ala Val Val Pro Gly Glu 
2085 2090 2095 
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Thr Pro Gin Asp Ala Gly Ala Gin Leu Trp Gly Phe Gly Arg Val Ala 
2100 2105 2110 

Ala Leu Glu Leu Pro Asp He Trp Gly Gly Leu He Asp Leu Pro Glu 
2115 2120 2125 

Thr Ala Glu Leu Thr Arg Thr Pro Glu Thr Ser Gin Pro Pro Gin Thr 
2130 2135 2140 

Pro Glu Arg Leu Pro Gin Thr Pro Asn Arg Arg Ala Leu Glu Leu Ala 
2145 2150 2155 2160 

Ala Ala Val Leu Ala Gly Arg Asp Gly Glu Asp Gin Val Ala Val Arg 
2165 2170 2175 

Ala Ser Gly He Tyr Gly Arg Arg Val Ser Arg Ala Ala Ala Ala Gly 
2180 2185 2190 

Ala Ala Ser Trp Gin Pro Ser Gly Thr Val Leu lie Thr Gly Gly Met 
2195 2200 2205 

Gly Ala He Gly Arg Arg Leu Ala Arg Arg Leu Ala Ala Glu Gly Ala 
2210 2215 2220 

Glu Arg Leu Val Leu Thr Ser Arg Arg Gly Pro Glu Ala Pro Gly Ala 
2225 2230 2235 2240 

Ala Glu Leu Ala Glu Glu Leu Arg Gly His Gly Cys Glu Val Val His 
2245 2250 2255 

Ala Ala Cys Asp Val Ala Glu Arg Asp Ala Leu Ala Ala Leu Val Thr 
2260 2265 2270 

Ala Tyr Pro Pro Asn Ala Val Phe His Thr Ala Gly He Leu Asp Asp 
2275 2280 2285 

Ala Val He Asp Thr Leu Ser Pro Glu Ser Phe Glu Thr Val Arg Gly 
2290 2295 2300 

Ala Lys Val Cys Gly Ala Glu Leu Leu His Gin Leu Thr Ala Asp He 
2305 2310 2315 2320 

Lys Gly Leu Asp Ala Phe Val Leu Phe Ser Ser Val Thr Gly Thr Trp 
2325 2330 2335 

Gly Asn Ala Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu Asp 
2340 2345 2350 

Ala Leu Ala Glu Arg Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val 
2355 2360 2365 

Ala Trp Gly Leu Trp Gly Gly Gly Gly Met Ala Ala Gly Ala Gly Glu 
2370 2375 2380 

Glu Ser Leu Ser Arg Arg Gly Leu Arg Ala Met Asp Pro Asp Ala Ala 
2385 2390 2395 2400 

Val Asp Ala Leu Leu Gly Ala Met Gly Arg Asn Asp Val Cys Val Thr 
2405 2410 2415 
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Val Val Asp Val Asp Trp Glu Arg Phe Ala Pro Ala Thr Asn Ala He 
2420 2425 2430 

Arg Pro Gly Arg Leu Phe Asp Thr Val Pro Glu Ala Arg Glu Ala Leu 
2435 2440 2445 

Thr Ala Ala Gly Thr Thr Ser Ala Thr Pro Asp Gly Ala Pro Glu Leu 
2450 2455 2460 

Ala Arg Arg Leu Ser Met Leu Asn Glu Thr Glu Arg Leu Arg Lys Leu 
2465 2470 2475 2480 

Val Glu Leu Val Arg Thr Glu Ala Ala Phe Val Leu Arg His Pro Asn 
2485 2490 2495 

Thr Asp Ala He Gly Ala Glu Arg Pro Phe Lys Ser Ala Gly Phe Asp 
2500 2505 2510 

Ser Leu Thr Ser Leu Glu Leu Arg Asn Arg Leu Asn Ala Gly Thr Gly 
2515 2520 2525 

Leu Lys Leu Pro Ala Thr Val He Phe Asp His Pro Ser Pro Thr Ala 
2530 2535 2540 

Leu Ala Arg Leu Leu Leu Asp Arg Leu Thr Gly Ala Gly Ala Pro Ala 
2545 2550 2555 2560 

Pro Ala Ala Asp Glu Pro Pro Leu Pro Val Ala Val Ala Asp Asp Asp 
2565 2570 2575 

Pro Val Val He Val Gly Met Ala Cys Arg Phe Pro Gly Gly Ala Gly 
2580 2585 2590 

Thr Pro Glu Ala Leu Trp Lys Leu Val Thr Glu Glu Arg Asp Val He 
2595 2600 2605 

Gly Ala Ala Pro Thr Asp Arg Gly Trp Asp Leu Asp Ser Val Tyr Asp 
2610 2615 2620 

Pro Glu Pro Gly Val Ala Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe 
2625 2630 2635 2640 

Leu His Asp Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro 
2645 2650 2655 

Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr 
2660 2665 2670 

Ser Trp Glu Ala He Glu Arg Ala Gly He Asp Pro His Ser Leu His 
2675 2680 2685 

Gly Ser Arg Thr Gly Val Tyr Val Gly Leu Thr His Gin Glu Tyr Ala 
2690 2695 2700 

Ser Arg Leu His Glu Ala Pro Glu Glu Tyr Glu Gly Tyr Leu Leu Thr 
2705 2710 2715 2720 

Gly Lys Ser Ala Ser Val Val Ser Gly Arg He Ser Tyr Thr Leu Gly 
2725 2730 2735 
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Leu Glu Gly Pro Ser Leu Ser He Asp Thr Ala Cys Ser Ser Ser Leu 
2740 2745 2750 

Val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Gly Gly Glu Cys Asp 
2755 2760 2765 

Met Ala Leu Ala Gly Gly Val Thr Val Met Ala Ala Pro Gly Leu Phe 
2770 2775 2780 

Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys 
2785 2790 2795 2800 

Ala Phe Ala Asp Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly 
2805 2810 2815 

Val Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Leu Gly His Pro 
2820 2825 2830 

Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin Asp Gly Ala Ser 
2835 2840 2 845 

Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu Arg Val He Ara 
2850 2855 2860 

Gin Ala Leu Ala Asn Ala Arg Leu Thr Val Ala Asp Val Asp Val Val 
2865 2870 2875 2 880 

Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro He Glu Ala Gin 
2885 2890 2895 

Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Ala Glu Cys Pro Val 
2900 2905 2910 

^ LSU o^ Ser LeU Lys Ser Asn Ile Glv Hi * A1 » Gin Ala Ala Ala 
2915 2920 2925 

2930^* 2935 Met ^ ^ 2940^ ^ 

Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg His Val Asp Trp 
2945 2950 2955 2960 

Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly 
2965 2970 2975 

Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Glv 
238° 2985 2 990 

^ ooL HiS ^ 116 LSU Glu Ala Pro Glu Al a Leu Glu Ala Leu 

2995 3000 3005 

Glu Ala Thr Asp Ala Pro Glu Ala Pro Glu Ala Pro Glu Ala Pro Asp 
3010 3015 3020 

Val Thr Asp Val Thr Glu Ala Leu Glu Ala Pro Asp Ala Thr Glu Ala 
3025 3030 3035 3040 

Glu Gly Ala Lys Ala Pro Gly Ser Pro Glu Glu Ala Gin Pro Ala Val 
3045 3050 30S5 
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Gly Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg 
3060 3065 3070 

Glu Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly 
3075 3080 3085 

Leu Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg Phe Gly Tyr 
3090 3095 3100 

Arg Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu 
3 1°5 3110 3115 3120 

Arg Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala 
3125 3130 3135 

Val Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val Leu Val 
3140 3145 3150 

Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu 
3155 3160 3165 

Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala 
3170 3175 3180 

Leu Ser Val His Val Glu Trp Asp Leu Leu Glu Val Val Ser Gly Gly 
3185 3190 3195 3200 

Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val 
3205 3210 3215 

Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala 
3220 3225 3230 

Ala Val Val Gly His Ser Gin Gly Glu lie Ala Ala Ala Thr Val Ala 
3235 3240 3245 

Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala 
3250 3255 3260 

Gly Leu lie Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val 
3265 3270 3275 3280 

Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly 
3285 3290 3295 

Val Gin Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly 
3300 3305 3310 

Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly 
3315 3320 3325 

Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His 
3330 3335 3340 

Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Arg 
3345 33 *0 3355 3360 

Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin 
3365 3370 3375 
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Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg 
3380 3385 3390 

Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Gin Gly 
3395 3400 3405 

His Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala 
3410 3415 3420 

He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly 
3425 3430 3435 3440 

Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr 
3445 3450 3455 

Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu 
3460 3465 3470 

Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His 
3475 3480 3485 

His Tyr Trp Ala Val Thr Ser Pro Ala Gly Val Gly Asp Ala Ala Ala 
3490 3495 3500 

Gly Arg Phe Gly Met Thr Trp Glu Asp His Pro Phe Leu Arg Gly Gly 
3505 3510 3515 3520 

Leu Pro Leu Ala Asp Ser Gly Glu Arg Val Phe Ala Gly Arg Leu Ala 
3525 3530 3535 

Gly Ser Glu His Asp Trp Leu Thr Asp His Ala Val Ser Gly Val Thr 
3540 3545 3550 

Leu Leu Pro Gly Thr Ala Phe Val Glu Phe Ala Leu His Ala Gly Ala 
3555 3560 3565 

Ala Thr Gly Cys Gly Arg Leu Glu Glu Leu Ser Val Glu Ala Pro Leu 
3570 3575 3580 

Val Leu Pro Ala Ala Gly Gly Val Arg Val Gin Met Arg Val Ser Ala 
3585 3590 3595 3600 

Ala Asp Glu Ser Gly Arg Arg Arg Val Ala He His Ser Ala Pro Glu 
3605 3610 3615 

Ala Ala Val His Ser Ala Ala Glu Gly Gly Asp Ser Ala Gly Val Trp 
3620 3625 3630 

Thr Arg His Gly Glu Gly Thr Leu Val Pro Asp Pro Glu Pro Thr Pro 
3635 3640 3645 

Pro Asp Ala Asp Trp Ala Arg Ala Trp Pro Pro Ala Gly Glu Arg Val 
3650 3655 3660 

Glu Pro Ala Glu Leu Tyr Glu Arg Phe Gly Ala Leu Gly Tyr Glu Tyr 
3665 3670 3675 3680 

Gly Glu Ala Phe Ala Gly Val Arg Ala Val Trp Arg Gin Pro Asp Ala 
3685 3690 3695 
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Leu Leu Ala Glu Val Leu Leu Pro Asp Arg Ala Ser Thr Gly Ala Gly 
3700 3705 3710 

Arg Phe Gly Val His Pro Ala Leu Leu Asp Ala Ala Leu Gin Pro Trp 
3715 3720 3725 

lie Ala Gly Gly Leu Leu Glu Val Pro Glu Asp Ala Val Leu Leu Pro 
3730 3735 3740 

Phe Ala Trp Gin Gly Val Ser Leu Tyr Ala Thr Gly Ala Gly Ala Leu 
3745 3750 3755 3760 

Arg Val Arg Leu Thr Lys Ala Gly Asp Gly Ala Val Ser Leu Gin Ala 
3765 3770 3775 

Ala Asp Thr Ser Gly Ala Ala Val Leu Ser Leu Gly Ala Leu Val Met 
3780 3785 3790 

Arg Pro Leu Ala Arg Arg Lys Leu Asp Val Leu Leu Gly Thr Asp Ala 
3795 3800 3805 

Gly Glu Arg Ser Leu Tyr Arg Val Glu Trp Gin Pro Arg Leu Leu Pro 
3810 3815 3820 

Ala Gly Pro Pro Arg Ser Trp Ala Val Leu Gly Pro Asp Ala Asp Arg 
3825 3830 3835 3840 

Leu Ala Gly Thr Pro Gly Leu Gly Asp Gin Pro Asp Gly Gly Pro Thr 
3845 3850 3855 

Ala Leu Tyr Pro Glu Val Arg Ala Leu Arg Lys Ala Leu Ala Ala Gly 
3860 3865 3870 

Ala Pro Arg Pro Glu Ala Val Val Leu Pro Val Leu Ser Gly Ala Gly 
3875 3880 3885 

Ala Thr Pro Glu Ser Val Arg Gin Thr Thr Glu Arg Cys Leu Thr Ala 
3890 3895 3900 

Leu Gin Asp Trp Leu Asp Ala Glu Glu Leu Val Asp Thr Pro Leu lie 
3905 3910 3915 3920 

Val Leu Thr Arg Gly Ala Val Ala Ala Val Pro Gly Glu Glu He Gly 
3925 3930 3935 

Asp Leu Ala Cys Ala Gly Val Trp Gly Leu Val Arg Ser Ala Arg Ser 
3940 3945 3950 

Glu His Pro Gly Arg Phe Ala Leu Val Asp Thr Asp Gly His Pro Asp 
3955 3960 3965 

Asp Arg Thr Ala Leu Pro Leu Ala Leu Arg Ala Val Leu Asp Gly Ala 
3970 3975 3980 

Gly Gin Leu Ser Leu Arg Ala Gly Thr Ala Arg Thr Pro Val Leu Leu 
3985 3990 3995 4000 

Arg Ala Gly Thr Pro Glu Glu Gin Arg Gly Pro Ala Phe Asp Pro Ala 
4005 4010 4015 
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Gly Thr Val Leu Val Thr Gly Ala Thr Gly Thr Leu Gly Arg Leu Leu 
4020 4025 4030 

Ala Arg His Leu Ala Ala Glu His Gly Val Arg His Leu Leu Leu Leu 
4035 4040 4045 

Ser Arg Gly Gly Arg Ala Ala Glu Gly Ala Asp Glu Leu Ala Ala Glu 
4050 4055 4060 

Leu Ala Gly Leu Glu Ala Glu Pro Cys Phe Ala Ala Cys Asp Ala Ala 
4065 4070 4075 4080 

Asp Arg Glu Ala Leu Ala Arg Val Leu Ala Glu Val Pro Ala Asp Arg 
4085 4090 4095 

Pro Leu Thr Gly Val He His Ala Ala. Gly Val Leu Asp Asp Gly Thr 
4100 4105 4110 

Leu Asp Ala Leu Thr Pro Glu Arg He Gly Thr Val Met Arg Pro Lys 
4115 4120 4125 

Ala Asp Ala Ala Leu Asn Leu His Glu Leu Thr Arg Thr Ser Pro Leu 
4130 4135 4140 

Ser Val Phe Ala Val Phe Ser Gly Ala Ala Gly He Leu Gly Arg Pro 
4145 4150 4155 4160 

Gly Gin Ala Asn Tyr Ala Ala Ala Asn Thr Phe Leu Asp Ala Leu Ala 
4165 4170 4175 

Gin His Arg Arg Ala His Gly Leu Pro Ala Val Ser Leu Ala Trp Gly 
4180 4185 4190 

Leu Trp Gly Gly Ala Thr Gly Met Thr Gly His Leu Ser Gly Thr Asp 
41 ^5 4200 4205 

Leu Arg Arg Met Arg Arg Ser Gly He Ala Pro Met Thr His Asp Gin 
421° 4215 4220 

Gly Leu Ala Leu Phe Asp Arg Ala Leu Ala Ala Ser Ala Glu Asp Pro 
4225 4230 4235 4 240 

Leu Leu Val Pro Met Arg Leu Asp Leu Ala Ala Leu Val Arg Glu Arg 
4245 4250 4255 

Ala Glu His Gly Pro Asp Ala Val Pro Gly Pro Leu Leu Gly Leu Leu 
4260 4265 4270 

Pro Ala Arg Ala Ala Val Arg Gin Ala Ala Ala Pro Val Arg Gly Gly 
427 * 4280 4285 

Ala Pro Ala Pro Ala Gly Gly Glu Gly Thr Ala Glu Arg Leu Ala Gly 
4290 4295 4 300 

Leu Gly Glu Glu Ala Arg Leu Arg Glu Leu Val Arg Leu Val Arg Ala 
4305 4310 4315 4320 

Glu* Val Ser Gly Val Leu Gly Tyr Ser Gly Pro Asp Ala Val Glu Pro 
4325 4330 4335 
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Gly Arg Pro Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu 
4340 4345 4350 

Leu Arg Asn Arg Leu Gly Ala Ala Thr Gly Leu Arg Leu Pro Thr Ala 
4355 4360 4365 

Leu Val Phe Asp Arg Pro Thr Ser Gin Ala Val Ala Glu Tyr Leu Ala 
4370 4375 4380 

Ala Glu Leu Ala Gly Pro Arg Asp Gly Gly Asp Thr Ala Ala Ala Ala 
4385 4390 4395 4400 

Phe Glu Gly Leu Glu Ala Leu Ala Ala Ala Val Gly Ala Leu Ala Glu 
4405 4410 4415 

Asp Asp Leu Arg Arg Asp Val Leu Arg Arg Arg Leu Thr Glu Leu Ala 
4420 4425 4430 

Ala Ala Leu Thr Pro Gin Gly Arg Asn Pro Ser Ala Pro Ala Pro Ala 
4435 4440 4445 

Pro Ser Asp Leu Asp Glu Arg Leu Asp Ser Ala Asn Asp Asp Asp Leu 
4450 4455 4460 

Phe Ala Phe He Glu Glu Gin Leu * 
4465 4470 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1865 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Met Thr Ala Glu Leu Val Ala Thr Arg Lys Arg Leu Gly Ala Leu Glu 
1 5 10 15 

Glu Arg Ala Arg Glu Pro He Ala Val Val Ala Met Ser Cys Arg Tyr 
20 25 30 

Pro Gly Gly Val Thr Thr Pro Glu Asp Leu Trp Arg Leu Leu Ala Asp 
35 40 45 

Glu Arg Asp Ala Val Ser Gly Leu Pro Arg Asp Arg Gly Trp Asp Leu 
50 55 60 

Asp Ala Leu Tyr Asp Pro Asp Gly Gly Pro Gly Thr Ser Tyr Ala Arg 
65 70 75 80 

Glu Gly Gly Phe Leu Ser His Cys Ala Gly Phe Asp Ala Glu Phe Phe 
85 90 95 

Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu 
100 105 HO 
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Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly Val Thr Ala 
115 120 125 

Asp Arg Ala Arg Gly Ser Arg Thr Gly Val Tyr Ala Gly Val Met Tyr 
130 135 140 

Asp Asp Tyr Gly Ala Arg Val Leu Tyr Gly Ala Gly Ala Gly Pro Pro 
145 150 155 160 

Glu Asp Leu Glu Gly Tyr Leu Val Asn Gly Ser Ala Gly Ser lie Ala 
165 170 175 

Ser Gly Arg Val Ser Tyr Thr Phe Gly Leu Arg Gly Pro Ala Val Thr 
180 185 190 

Val Asn Thr Ala Cys Ser Ser Ser Leu Val Ser Leu His Leu Ala Val 
195 200 205 

Arg Ala Leu Arg Asn Gly Glu Cys Asp Met Ala Leu Ala Gly Gly Ala 
210 215 220 

Thr Val Leu Ser Thr Pro Thr Val Leu Val Asp Phe Ser Arg Gin Arg 
225 230 235 240 

Gly Leu Ala Pro Asp Gly Arg Cys Lys Ala Phe Ala Asp Ser Ala Asp 
245 250 255 

Gly Thr Ser Trp Ala Glu Gly Ala Gly Met Leu Leu Leu Gin Arg Leu 
260 265 270 

Ser Asp Ala Arg Arg Glu Gly Arg Pro Val Leu Ala Val lie Arg Gly 
275 280 285 

Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn 
290 295 300 

Gly Arg Ala Gin Arg Gin Val lie Glu Asp Ala Leu Arg Asp Ala Gly 
305 310 315 320 

Val Gly Pro Asp Gin Val Asp Ala Val Glu Ala His Gly Thr Gly Thr 
325 330 335 

Glu Leu Gly Asp Pro lie Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly 
340 345 350 

Thr Ala Arg Thr Ala Glu Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser 
355 360 365 

Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys 
370 375 380 

Met Val Leu Ala Met Arg His Gly Arg Leu Pro Arg Thr Leu His Val 
385 390 395 400 

Asp Arg Pro Thr Thr Arg Val Asp Trp Glu Lys Gly Gly Val Arg Leu 
405 410 415 



Leu Thr Glu Pro Val Pro Trp Pro Gly Glu Ala Gly Glu Pro Arg Arg 
420 425 430 
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Ala Gly Val Ser Ser Phe Gly Ala Ser Gly Thr Asn Ala His Val Val 
435 440 445 

Leu Glu Ser Val Pro Ala Gly Glu Pro Pro Ala Ala Gly Arg Pro Glu 
450 455 460 

Asp Thr Gly Gly Ala Trp Thr Val Ser Gly Arg Gly Pro Ala Ala Leu 
465 470 475 480 

Arg Ala Gin Ala Ala Arg Leu Tyr Asp Ala Leu Thr Gly Thr Gly Thr 
485 490 495 

Gly Thr Gly Gin Gly Ala Gly Gin Gly Ala Gly Pro Gly Thr Ala Glu 
500 505 510 

Val Ala Gly Ala Leu Ala His Ala Arg Thr Ala Phe Arg His Arg Ala 
515 520 525 

Val Val Leu Gly Gly Asn Arg Ala Glu Leu Leu Ala Gly Leu Arg Glu 
530 535 540 

Leu Ala Glu Glu Glu His Pro Gly Pro Arg Val Val Thr Gly Thr Ala 
545 550 555 560 

Pro Ala Thr Glu Arg Arg Thr Ala Phe Leu Phe Ser Gly Gin Gly Ser 
565 570 575 

Gin Arg Ala Gly Ser Gly Arg Gly Leu Tyr Arg Arg His Pro Val Phe 
580 585 590 

Ala Arg Ala Leu Asp Glu Val Cys Ala Ala Leu Glu Pro His Leu His 
595 600 605 

Arg Pro Leu Arg Asp Leu Met Phe Ala Glu Pro Gly Ser Pro Glu Ala 
610 615 620 

Glu Pro Leu Asp Arg Thr Glu Phe Thr Gin Pro Ala Leu Phe Ala Leu 
625 630 635 640 

Gin Thr Ala Leu Phe Arg Leu Ala Glu His His Gly Leu Arg Ala Glu 
645 650 655 

Ala Leu Cys Gly His Ser Val Gly Glu He Ala Ala Ala His Ala Ala 

660 665 670 

Gly Val Leu Thr Leu Pro Asp Ala Ala Arg Leu Val Ala Ala Arg Gly 
675 680 685 

Arg Leu Met Gin Ala Leu Pro Ala Gly Gly Ala Met Ala Ala Leu Arg 
690 695 700 

Ala Thr Ala Glu Glu He Ala Pro Leu Leu Glu Arg Arg Ala Gly Glu 
705 710 715 720 

Leu Ala Leu Ala Ala Val Asn Gly Pro Ser Ser Val Val Val Ser Gly 

725 730 735 



Asp Glu Ala Ala Val Leu Glu Leu Leu Glu Gin Trp Arg Ala Glu Gly 
740 745 750 
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Arg Glu Ala Arg Arg Leu Ala Val Ser His Ala Phe His Ser Pro Arg 
755 760 765 

Met Asp Gly Met Leu Thr Gin Phe Asp Arg Val Ala Arg Thr Leu Thr 
770 775 780 

Phe Ala Pro Pro Thr He Pro Leu Val Ser Thr Leu Thr Gly Thr Pro 
785 790 795 800 

Val Thr Glu Glu Thr Leu Cys Thr Ala Asp His Trp Val Arg Gin Ala 
805 810 815 

Arg Glu Pro Val Arg Phe Leu Asp Ala Met Arg Thr Leu Arg Ala Asp 
820 825 830 

Gly He Asp Thr Phe Val Glu Leu Gly Pro Asp Gly Val Leu Ser Ala 
835 840 845 

Met Ala Arg Asp Cys Ala Asp Asp Arg Pro Asp Gly Asp Thr Thr Gly 
850 855 860 

Ala Gly Asp Gly Glu Thr Pro Asp Pro Leu Leu Thr Leu Pro Leu Leu 
865 870 875 880 

Arg Arg Ser Val Pro Glu Thr Gly Asp Ala Glu His Pro Gly Gly Phe 
885 890 895 

Glu Arg Ala Leu Ala Thr Ala Tyr Ala His Gly Val Pro Leu Arg Leu 
900 90S 910 

Ala Pro Ala Pro Asp Ala Ala Ser Leu Ala Val Ala Ala Glu Leu Pro 
915 920 925 

Thr Tyr Ala Phe Gin Arg Thr His Tyr Trp Leu Asp Ala Pro Ala Ala 
930 935 940 

Pro Ala Ala Leu Pro Ala Gly Leu Asp Asp Ala Gly His Pro Leu Leu 
945 950 955 960 

Ser Ala Ala Leu Asp Leu Pro Gly Gly Arg Gly Thr Val Trp Thr Gly 
965 970 975 

Ala Leu Ser Ala Ala Thr Leu Pro Trp Ala Ala Asp His Ser Val His 
980 985 990 

Gly Arg Thr Val Leu Pro Gly Thr Ala Leu Leu Asp Leu Ala Leu His 
995 1000 1005 

Ala Ala Pro Arg Val Gly Glu Leu Thr Phe Glu Ala Pro Leu Val Leu 
1010 1015 1020 

Pro Glu Asp Gly Glu Val Arg Leu Arg Val Val Leu Ala Glu Pro Asp 
1025 1030 1035 1040 

Ala Ser Gly Val Arg Glu Leu Ser Val His Ser Ala Gly Glu Asp Gly 
1045 1050 1055 

Gly Trp Thr Arg His Ala Thr Ala Val Leu Asp Thr Gly Thr Thr Thr 
1060 1065 1070 
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Ala Gly Glu Pro Ala Gly Ala Pro Pro Ala Ala Tip Pro Pro Gly Asp 
1075 1080 1085 

Ala Glu Pro Leu Asp Leu Ala Ala Glu Tyr Glu Arg Phe Ala Asp Ala 
1090 1095 1100 

Gly He Gly Tyr Gly Pro Ala Phe Arg Gly Leu Arg Ser Ala Trp Arg 
1105 1110 1115 1120 

Asp Gly Asp Ala He Leu Ala Asp Val Arg Leu Pro Gly Glu Leu Ala 
1125 1130 1135 

Gly Glu Ala Asp Arg Tyr Gly He His Pro Ala Leu Leu Asp Ala Ala 
1140 1145 1150 

Leu His Thr Ala Ala Ala Ala Leu Gly Gly Ala His Gly Met Leu Pro 
1155 1160 1165 

Phe Thr Trp Asn Gly Val Thr Leu His Ala Arg Gly Ala His Ala He 
1170 1175 1180 

Arg Val Arg Leu Thr Pro Ala Gly Pro Asp Ala Val Ala Val Thr Ala 
1185 1190 1195 1200 

Val Asp Pro Ala Gly Arg Pro Val Phe Thr Ala Ala Ser Leu Thr Leu 
1205 1210 1215 

Arg Pro Val Thr Thr Gly Gin Leu Thr Ala Ala Glu Ala Ala Arg Ala 
1220 1225 1230 

Pro Leu Tyr Arg Val Arg Trp Thr Gly Leu Pro Asp Thr Gly Thr Ala 
1235 1240 1245 

Arg Asp His Thr Trp Ala Val Ala Gly Gly Pro Gly Asp Leu Leu Pro 
1250 1255 1260 

Gly Glu Thr Pro His His Pro Asp Leu Ala Ser Ala Leu Ala Asp Thr 
1265 1270 1275 1280 

Gly Thr Ala Pro Phe Arg Val Leu Ala Asp Leu Arg Gly Tyr Gly Thr 
1285 1290 1295 

Ala Thr Pro Arg Glu Leu Ala Ser Gin Ala Leu Ala Leu Val Gin Gin 
1300 1305 1310 

Trp Ala Asp Ala Ala Glu Ala Ala Glu Gly Arg Leu Val Leu Val Thr 
1315 1320 1325 

Arg Arg Ala Val Asp He Gly Asp Gly Val Thr Asp Pro Ala Ala Ala 
1330 1335 1340 

Thr Val Trp Gly Leu Val Arg Ala Ala Gin Ser Glu His Pro Gly Cys 
1345 1350 1355 1360 

Phe Ala Leu Leu Asp Thr Asp Asp Ser Pro Arg Ser Arg Gin Leu Leu 
1365 1370 1375 

Pro Arg Val Ala Gly Thr Ala Glu Gin Leu Ala Leu Arg Asp Gly Thr 
1380 1385 1390 
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Leu Leu Ala Pro Ser Leu Thr Arg Ala Thr Leu Pro Ala Gly Ala Arg 
1395 1400 1405 

Leu Pro Ala Leu Asp Gly Thr Val Leu He Thr Gly Gly Thr Gly Ser 
1410 1415 1420 

Leu Gly Ala Glu Ala Ala Arg His Leu Val Thr Arg His Gly Ala Arg 
1425 1430 1435 1440 

Arg Leu Leu Leu Thr Ser Arg Ser Gly Pro Gin Ala Pro Gly Ala Ala 
1445 1450 1455 

Glu Leu Val Ala Glu Leu Ala Ala Leu Gly Ala His Ala Asp Val Ala 
1460 1465 1470 

Ala Cys Asp Val Ala Asp Arg Ala Ala Leu Arg Ala Leu Leu Asp Arg 
1475 1480 1485 

Val Pro Ala Gly His Pro Leu Thr Ala Val Leu His Thr Ala Gly Val 
1490 1495 1500 

Leu Asp Asp Gly Val Leu Thr Ala Gin Thr Pro Gin Arg Leu Ala Ala 
1505 1510 1515 1520 

Val Leu Arg Pro Lys Ala Asp Ala Val Arg Asn Leu His Glu Leu Thr 
1525 1530 1535 

Gin Gly His Ala Leu Ser Ala Phe He Leu Tyr Ser Ser Ala Ala Gly 
1540 1545 1550 

Val Leu Gly Ser Ala Gly Gin Ser Gly Tyr Ala Ala Ala Asn Ala Tyr 
1555 1560 1565 

Leu Asp Ser Phe Ala Val Trp Arg Arg Ser Arg Gly Leu Pro Ala Val 
1570 1575 1580 

Ser Leu Gly Trp Gly Pro Trp Asp Gly Gly Gly Met Ala Ser Gly Leu 
1585 1590 1595 1600 

Gly Gly Thr Asp Thr Ala Arg Leu Arg Arg Ser Gly He Ala Pro Leu 
1605 1610 1615 

Ser Arg Ala Glu Gly Leu Ala Ala Leu Asp Ala Ala Leu Ala Ala Gly 
1620 1625 1630 

Gly Asp Asp Thr Ala Pro Ala His Leu Leu Pro He Arg Val Asp Ala 
1635 1640 1645 

Val Thr Leu Arg Gly Ala Asp Thr Val Pro Ala Val Leu Arg Asp Leu 
1650 1655 1660 

Ala Gly Thr Ala Pro Ser Ala Ala Glu Arg Pro Pro Gly Thr Pro Glu 
1665 1670 1675 1680 

Asp Thr Asn Ala Pro Leu Ala Asp Val Thr Gin Leu His Gly Arg Glu 
1685 1690 1695 

Arg Lys Glu Ala Leu Thr Gly Phe Val Arg Ala Gin Val Ala Ala Val 
1700 1705 1710 
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Leu Gly His Pro Thr Ser Asp Thr lie Asp Val Arg Arg Ser Phe Lys 
1715 1720 1725 

Glu Ala Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg Asn Arg Leu 
1730 1735 1740 

Arg Ala Ala Thr Gly Leu Lys Leu Pro Ala Thr Leu Val Phe Asp His 
1745 1750 1755 1760 

Pro Thr Pro Leu Ala Leu Ala Gly Phe Leu His Arg Glu Leu Pro Gly 
1765 1770 1775 

Ala Glu Ala Ser Leu Met Ser Ala lie Asp Thr Leu Arg His Arg Leu 
1780 1785 1790 

Arg Asp Ala Leu Ala Asp Asp Ala Ala Asp Asp Ala Leu Arg Asp Gin 
1795 1800 1805 

lie Thr Arg Arg Leu Glu Thr Leu Leu Ala Gly lie Ala Arg Thr Glu 
1810 1815 1820 

Glu Pro Ala Pro Ala Thr Ala Ala Ala Asp Asp Gly Ser Gly Ala Gly 
1825 1830 1835 1840 

Asp Val Ala Glu Arg Leu Ser Thr Ala Ser Asp Asp Glu Leu Phe Glu 
1845 1850 1855 

Leu Leu Asp Ser Gly Phe Thr Pro * 

1860 1865 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3730 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Ser Thr Glu Asn Ser Thr Asn Val Pro Ala Ser Glu Asp Lys Leu 
15 10 15 

Arg Ala Tyr Leu Arg Arg Ala Met Ala Asp Leu His Glu Ser Arg Glu 
20 25 30 

Arg Leu Arg Ala Thr Glu Ala Arg Ala Gin Glu Pro He Ala Val Val 
35 40 45 

Gly Met Gly Cys Arg Phe Pro Gly Gly Val Gly Ser Pro Glu Ala Leu 
50 55 60 

Trp Arg Leu Val Val Glu Gly Val Asp Ala Val Ser Pro Phe Pro Gly 
65 70 75 80 

Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly Val 
85 90 95 
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Ala Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala 
100 105 110 

Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Val Ala 
115 120 125 

Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala He 
130 135 140 

Glu Arg Ala Gly He Asp Pro His Ser Leu His Gly Ser Arg Thr Gly 
"5 150 155 160 

Val Tyr Ala Gly Val Met Tyr His Asp Tyr Gly Thr Gly Gin Thr Ser 
165 170 175 

Ala Thr Asp Thr Ser Gly Tyr Ser Gly Thr Gly Thr Ser Gly Ser Val 
180 185 190 

Val Ser Gly Arg Val Ala Tyr Thr Leu Gly Leu Glu Gly Pro Ala Val 
195 200 205 

Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala 
210 215 220 

Val Gin Ala Leu Arg Gly Gly Glu Cys Asp Met Ala Leu Ala Gly Gly 
225 230 235 240 

Val Thr Val Met Ala Gly Pro Gly Met Phe Val Glu Phe Ser Arg Gin 
245 250 255 

Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Asp Gly Ala 
260 265 270 

Asp Gly Thr Ala Trp Ala Glu Gly Ala Gly Val Val Leu Val Glu Arg 
275 280 285 

Leu Ser Asp Ala Arg Arg Leu Gly His Pro Val Leu Ala Val Val Cys 
290 295 300 

Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
305 310 315 320 

Ser Gly Pro Ser Gin Glu Arg Val He Arg Gin Ala Leu Ala Asn Ala 
325 330 335 

Arg Leu Thr Val Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly 
340 345 350 

Thr Arg Leu Gly Asp Pro He Glu Ala Gin Ala Leu Leu Gly Thr Tyr 
355 360 365 

Gly Arg Asp Arg Asp Gly Gly Arg Pro Val Trp Leu Gly Ser Leu Lys 
370 375 380 

Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Ala Gly Val He 
385 390 395 400 

Lys Met Val Leu Ala Met Arg Tyr Gly Trp Leu Pro Arg Thr Leu His 
405 410 415 
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Val Asp Glu Pro Ser Arg His Val Asp Trp Ser Ala Gly Gly Val Trp 

420 425 430 

Leu Leu Thr Glu Ala Arg Glu Trp Pro Gly Val Asp Arg Pro Arg Arg 

435 440 445 



10 



Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Leu He 
450 455 460 

Leu Glu Ala Pro Asp Thr Ala Glu Ala Glu Ser Ala Thr Thr Pro Val 

465 470 475 480 



15 



Arg Ser Glu Val Ser Glu Ser Ala Ala Val Phe Asp Ala Arg Ser Gly 
485 490 495 

Val Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu 
500 505 510 



20 



Ala Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu 
515 520 525 



25 



30 



35 



Ala Asp Val Ala Val Thr Met Ala Gly Arg Ser Arg Phe Gly TVr Arg 
530 535 540 

Ala Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arc 
545 550 555 560 

Ala Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val 
565 570 575 

Val Asp Pro Glu Thr Gly Ser Gly Gly Gly Gly Val Val Leu Val Phe 
580 585 590 

Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly 
595 600 605 

Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu 
610 615 620 



40 



Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala 
625 «0 635 640 

Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met 

645 650 655 



45 



Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala 
660 665 670 



so 



Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val Ala Gly 

675 680 • 685 

Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly 
690 695 700 



55 



Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro 
705 710 715 720 

Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val 

725 730 735 
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Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp 
740 745 750 

Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val 
755 760 765 

Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val 
770 775 780 

Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He Arg Pro 
785 790 795 800 

Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro 
805 810 815 

Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn 
820 825 830 

Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His 
835 840 845 

Arg Arg Phe lie Glu Val Ser Ala His Pro Val Leu Val His Ala He 
850 855 860 

Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr 
865 870 875 880 

Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala 
885 890 895 

Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro 
900 905 910 

Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His 
915 920 925 

Tyr Trp Leu Asp Thr Thr Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser 
930 935 940 

Pro Thr Asp Ala Gin Asn Pro Ala Asp Ala Leu Pro Tyr Lys Val Ser 
945 950 955 960 

Trp Lys Arg Leu Arg Asp Gin Asp Ser Leu Thr Ala Arg Leu Asp Gly 
965 970 975 

Arg Trp Leu Leu Val Val Pro Glu Ala Ser Ala Asp Pro Ser Val Ala 
980 985 990 

Glu Gly Val Ala Arg Glu Leu Thr Ala Arg Gly Ala Thr Val Glu Ser 
995 1000 1005 

Leu Thr Val Glu Pro Gly Ala Asp Arg Ser Arg Leu Arg Gly Leu Leu 
1010 1015 1020 

Val Asp Ala Thr Glu Arg Asp Glu Ala Gly Pro Leu Arg Gly He Val 
1025 1030 1035 1040 

Ser Leu Leu Ala Leu Ala Gly Asp His Ala Gly Ala Asp Gly Ala Arg 
1045 1050 1055 
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10 



15 



20 



Pro Val Val Pro Ala Gly Leu Ala Ala Ser Leu Ala Leu He Gin Ala 
1060 1065 1070 

Ala Gly Asp Ala Gly Thr Glu Ala Gly Leu Trp Ala Val Thr Arg Gly 
1075 1080 1085 

Ala Val Ala Ala Val Pro Gly Asp Val Pro Ala Pro Ser Gin Ala Leu 
1090 1095 1100 

Leu Trp Gly Phe Gly Arg Val Ala Gly He Glu Leu Pro His Cys Trp 
1105 1110 1115 1120 

Gly Gly Leu Leu Asp Leu Pro Thr Gly Pro Gly Asp Ser Gly Phe Arg 
1125 H30 1135 

Gin Leu Ala Ala Thr Leu Ala Gly Arg Pro Ala Glu Asp Gin Val Ala 
1140 1145 1150 

Leu Arg Ala Ser Gly Ala Tyr Gly Arg Arg Leu Val Arg Ala Ser Ala 
1155 1160 1165 

Ala Gly Gly Ala Asp Gly Trp Arg Pro Arg Gly Thr Val Leu Val Val 
1170 1175 1180 

Gly Asp Thr Ala Glu Val Ala Gly Pro Leu Val Arg Trp Leu Leu Gly 
25 H85 1190 1195 1200 

Asn Gly Ala Arg Arg Val Thr Leu Ser Gly Leu Ser Gly Pro Leu Pro 
1205 1210 1215 

30 Glu Glu Leu Ala Asp Val Ala Ala Arg Val Thr Val Ala Pro Cys Asp 

1220 1225 1230 

Pro Ala Asp Arg Pro Ala Leu Arg Thr Leu Leu Ala Glu Gin Ala Pro 
1235 '1240 1245 

55 Thr Ala Val Leu Val Ala Pro Pro Ala Val Pro Pro Thr Pro Leu Ala 

1250 1255 1260 

Glu Met Thr Ala Glu Ala Leu Ala He Ala Leu Ser Ala Lys Thr Gly 
1265 1270 1275 1280 

40 

Leu Val Asp Arg Leu Asp Ser Leu Leu Asp Glu Pro Asp Pro Leu Leu 
1285 1290 1295 

Glu Asp Gly Glu Leu Asp Ala Phe Val Val Phe Ser Ser Val Ala Gly 
45 1300 1305 1310 

Val Trp Gly Gly Ala Gly Gin Gly Gly Tyr Ala Ala Gly Thr Ala Tyr 
1315 1320 1325 

Leu Asp Ala Leu Ala Glu Cys Arg Arg Ala Gly Gly Leu Pro Val Thr 
50 1330 1335 1340 

Ser Val Ala Trp Thr Pro Trp Leu Gly Thr Pro Ala Ala Asp Ser Leu 
1345 1350 1355 1360 

55 Gly Glu Gin Met Ser Arg Ala Gly He Thr Pro Leu Asp Pro Ala Ala 

1365 1370 1375 
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Ser Leu Asp Ala Leu Ala Arg Ala Val Gly Arg Arg Ala Gly Cys Val 
1380 1385 1390 

Thr Val Ala Asp He Asp Trp Glu Arg Phe Ala Ser Ala Tyr Thr Ala 
1395 1400 1405 

Thr Arg Pro Thr Pro Met Phe Asp Glu Val Pro Glu Val Arg Arg lie 
1410 1415 1420 

Gin Ala Ala Trp Ala Glu Ala Glu Ala Asp Ala Ala Arg Ser Gly Ala 
1425 1430 1435 1440 

Gly Gly Asp Ser Gin Leu Leu Arg Ser Leu Arg Gly Arg Pro Glu Glu 
1445 1450 1455 

Ala Gin Leu Ala Glu Leu Leu Arg Leu Val Arg Thr His Ala Ala Ala 
1460 1465 1470 

Val Leu Gly Leu Gly Ser Pro Gly Ala Val Glu Ala Arg Arg Ser Phe 
1475 1480 1485 

Lys Asp Leu Gly Phe Asn Ser Val Thr Ala Val Glu Leu Arg Asn Arg 
1490 1495 1500 

Leu Lys Glu Ala Thr Gly Leu Arg Leu Glu Val Ser Leu Val Phe Asp 
1505 1510 1515 1520 

His Pro Asp Pro Ala Ser Leu Ala Arg His Leu Leu Asp Leu Ala Leu 
1525 1530 1535 

Gly Gin Glu Pro Glu Glu Thr Pro Arg Ala Phe Ala Leu Glu Pro Ala 
1540 1545 1550 

Pro Asn Gly Glu Pro He Ala He Val Ser Met Ala Cys Arg Met Pro 
1555 1560 1565 

Gly Gly Val Ser Thr Pro Glu Glu Leu Trp Arg Leu Leu Arg Asp Gly 
1570 1575 1580 

Lys Asp Ala He Gly Pro Phe Pro Ala Asn Arg Gly Trp Asp Leu Glu 
1585 1590 1595 1600 

Asn Leu Tyr Asp Pro Asp Pro Asp Ala Asp Gly Arg Thr Tyr Val Arg 
1605 1610 1615 

Glu Gly Gly Phe Leu His Glu Ala Pro Asp Phe Asp Pro Ser Phe Phe 
1620 1625 1630 

Gly He Ser Pro Arg Glu Ala Leu Ala Met Asp Pro Gin Gin Arg Leu 
1635 1640 1645 

Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Asp Pro 
1650 1655 1660 

Ala Arg Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly Thr Asn Gly 
1665 1670 1675 1680 

Gin His Tyr Met Pro Leu Leu Gin Asn Gly Gly Asp Ser Phe Asp Gly 
1685 1690 1695 
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Tyr Leu Gly Thr Gly Asn Ser Ala Ser Val Met Ser Gly Arg Leu Ser 
1700 1705 1710 

Tyr Val Phe Gly Leu Glu Gly Pro Ala Val Thr Val Asp Thr Ala Cys 
1715 1720 1725 

Ser Ala Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Met Arg Arg 
1730 1735 1740 

Gly Glu Cys Asp Met Ala Leu Val Gly Gly Ala Thr Val Met Ser Thr 
1745 1750 1755 1760 

Pro Glu Met Leu Val Glu Phe Ser Arg Gin Arg Val He Ser Ala Asn 
1765 1770 1775 

Gly Arg Ser Arg Ala Phe Ala Ala Gly Ala Asp Gly Val Ala Leu Gly 
1780 1785 1790 

Glu Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Glu Arg 
1795 1800 1805 

Asn Gly His Pro Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin 
1810 1815 1820 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ser Gin Gin 
l fi 25 1830 1835 1840 

Arg Val He Arg Gin Ala Leu Ala Asp Ala Gly Leu Arg Pro Glu Asp 
1845 1850 1855 

He Asp Ala Val Glu Ala His Gly Thr Gly Thr Glu Leu Gly Asp Pro 
I860 1865 1870 

He Glu Ala Glu Ala Leu Leu Ala Thr Tyr Gly Arg Thr Arg Thr Ala 
1875 1880 1885 

Asp Arg Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr 
1890 1895 1900 

Gin Ala Ala Ala Gly Val Ala Gly Val He Lys Met Val Leu Ala Leu 
1905 1910 1915 1920 

Gly Asn Glu Thr Leu Pro Arg Thr Leu His Val Asp Glu Pro Thr Pro 
1925 1930 1935 

Arg Val Asp Trp Ser Ser Gly Ala Val Ser Leu Leu Thr Glu Pro Val 
1940 1945 1950 

Asp Trp Pro Ala Gly Pro Ser Ala Pro Arg Arg Ala Ala Val Ser Ser 
1955 i960 1965 

Phe Gly He Ser Gly Thr Asn Ala His Thr He Leu Glu Gin Ala Pro 
1970 1975 1980 

Val Pro Ala Glu Ser Arg Pro Gly Thr Glu Pro Ala Asp Gly Thr Gly 
1985 1990 1995 2000 

Ala Trp Glu Asn Val Thr Val Pro Leu Leu Leu Ser Gly His Thr Glu 
2005 2010 2015 
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Ala Ala Leu Arg Glu Gin Ser Thr Arg Leu Leu Asn Asp Leu Leu Glu 
2020 2025 2030 

His Pro Asp Glu His Pro Ala Asp Val Gly Tyr Thr Leu He Thr Gly 
2035 2040 2045 

Arg Ala His Phe Gly His Arg Ala Ala Val He Gly Glu Ser Arg Glu 
2050 2055 2060 

Glu Leu Leu Asp Ala Leu Lys Ala Leu Ala Glu Gly Arg Glu His His 
2065 2070 2075 2080 

Thr Val Val Arg Gly Asp Gly Thr Ala His Pro Asp Arg Arg Val Val 
2085 2090 2095 

Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Pro Ser Met Ala Arg Asp 
2100 2105 2110 

Leu Leu Asp Arg Ala Pro Ala Phe Arg Glu Thr Ala Lys Ala Cys Asp 
2115 2120 2125 

Ala Ala Leu Ser Val His Leu Asp Trp Ser Val Leu Asp Val Leu Gin 
2130 2135 2140 

Glu Lys Pro Asp Ala Pro Pro Leu Ser Arg Val Asp Val Val Gin Pro 
2145 2150 2155 2160 

Val Leu Phe Thr Met Met Leu Ser Leu Ala Ala Cys Trp Arg Asp Leu 
2165 2170 2175 

Gly Val His Pro Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala 
2180 2185 2190 

Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg He 
2195 2200 2205 

Val Ala Leu Arg Ser Arg Ala Trp Leu Thr Leu Ala Gly Lys Gly Gly 
2210 2215 2220 

Met Ala Ala Val Ser Leu Pro Glu Ala Arg Leu Arg Glu Arg He Glu 
2225 2230 2235 2240 

Arg Phe Gly Gin Arg Leu Ser Val Ala Ala Val Asn Ser Pro Gly Thr 
2245 2250 2255 

Ala Ala Val Ala Gly Asp Val Asp Ala Leu Arg Glu Leu Leu Ala Glu 
2260 2265 2270 

Leu Thr Ala Glu Gly He Arg Ala Lys Pro He Pro Gly Val Asp Thr 
2275 2280 2285 

Ala Gly His Ser Ala Gin Val Asp Gly Leu Lys Glu His Leu Phe Glu 
2290 2295 2300 

Val Leu Ala Pro Val Ser Pro Arg Ser Ser Asp He Pro Phe Tyr Ser 
2305 2310 2315 2320 

Thr Val Thr Gly Ala Pro Leu Asp Thr Glu Arg Leu Asp Ala Gly Tyr 
2325 t 2330 2335 
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Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe Glu Lys Ala Val Arg 
2340 2345 2350 

Ala Leu lie Ala Asp Gly Tyr Asp Leu Phe Leu Glu Cys Asn Pro His 
2355 2360 2365 

Pro Met Leu Ala Met Ser Leu Asp Glu Thr Leu Thr Asp Ser Gly Gly 
2370 2375 2380 

His Gly Thr Val Met His Thr Leu Arg Arg Gin Lys Gly Ser Ala Lys 
2385 2390 2395 2400 

Asp Phe Gly Met Ala Leu Cys Leu Ala Tyr Val Asn Gly Leu Glu lie 
2405 2410 2415 

Asp Gly Glu Ala Leu Phe Gly Pro Asp Ser Arg Arg Val Asn Pro Pro 
2420 2425 2430 

Thr Tyr Pro Phe Gin Arg Glu Arg Tyr Trp Tyr His Pro Thr Ser Gly 
2435 2440 2445 

Arg Arg Gly Asp lie Thr Ala Ala Gly Val Ala Glu Ala Glu His Pro 
2450 2455 2460 

Leu Leu Gly Ala Gly Val Glu Leu Pro Glu Thr Gly Gly Thr Val Tyr 
2465 2470 2475 2480 

Thr Ala Arg Phe Gly Pro Asp Ser Arg Pro Trp Leu Ala Asp His Ala 
2485 2490 2495 

Leu Leu Gly Thr Val Leu Leu Pro Gly Thr Ala lie Leu Asp Leu Val 
2500 2505 2510 

Leu Trp Ala Gly Glu Arg Ser Gly Cys Gly Arg Val Gly Glu Leu Ala 
2515 2520 2525 

Leu Gin Ala Pro Leu Val Leu Pro Asp Ser Gly Asp Val Glu Leu Arg 
2530 2535 2540 

Leu Leu Val Gly Gly Pro Asp Glu Glu Lys Arg Arg Thr Val Thr Val 
2545 2550 2555 2560 

His Ala Arg Pro Ala Ala Ala Gly Ala Glu Ala Pro Trp Thr Arg His 
2565 2570 2575 

Ala Glu Ala Val Val Leu Pro Ala Thr Gly Glu Glu Pro Thr Pro Ala 
2580 2585 2590 

Pro Arg Pro Val Pro Glu Pro Ala Gly Thr Thr Asp Pro Ala Ala Phe 
2595 2600 2605 

Tyr Ala Glu Phe Ala Glu Arg Gly Tyr Asp Tyr Gly Pro Ala Phe Gin 
2610 2615 2620 

Gly Phe Thr Ala Gly Ala Arg His Gly Glu Asp Val Val Ala Glu Val 
2625 2630 2635 2640 

Ala Leu Pro Ser Gly Leu Val Ala Asp Ala Arg His His Arg Leu His 
2645 2650 2655 
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Pro Ala Leu Leu Asp Ala Ala Leu Gin Ala Met He Leu Gly Thr Phe 
2660 2665 2670 

Phe Ala Asp Asp Gly Arg Ala Arg Met Pro Phe Ala Val Arg Gly Val 
2675 2680 2685 

Arg Leu His Thr Ala Gly Ala Asp Arg Leu Arg Val Leu He Ser Pro 
2690 2695 2700 

Ala Gly Asp Glu Thr Val Arg Leu Leu Cys Thr Asp Leu Ala Thr Gly 
2705 2710 2715 2720 

Ala Pro Val Leu Glu He Asp Glu Leu Val Val Arg Pro Val Ser Gly 
2725 2730 2735 

Glu Gin Leu Ala Ala Gly Ala Pro Gly Arg Asn Gly Gly Glu Leu Tyr 
2740 2745 2750 

Arg Val Asp Trp Thr Val Leu Pro Glu Pro Ala Glu Val Pro Ala Pro 
2755 2760 2765 

Arg Trp Ala Leu Leu Gly Glu Asp His Ala Gly Leu Ala Asp Val Leu 
2770 2775 2780 

Gly Gly Thr Gly Gly Gly Cys Glu Arg Tyr Asp Thr Leu Thr Gly Leu 
2785 2790 2795 2800 

Leu Glu Ala Thr Thr Arg Ser Ala Gly Gly He Leu Pro Asp He Val 
2805 2810 2815 

Ala Leu Ser Leu Pro Thr Ala Pro Glu Pro Gly Pro Gin Ala Val Arg 
2820 2825 2830 

Glu Val Leu Ser Gin Ala Leu Asp Ala Ala Gin Ala Trp Leu Ala Ala 
2835 2840 2845 

Gly Ala Glu Thr Ala Ser Ala Arg Leu Val Phe Val Thr Gly Gly Ala 
2850 2855 2860 

Val Ala Thr Thr Ala Asp Glu Thr Val Arg Asp He Ala Ala Ala Ala 
2865 2870 2875 2880 

Val Trp Gly Leu Val Arg Ser Ala Gin Ser Glu Glu Pro Asp Arg Met 
2885 2890 2895 

Val Leu Leu Asp Leu Asp Gly Glu Arg Pro Thr Ala Arg Thr Leu Ala 
2900 2905 2910 

Ala Ala Leu Ala Ser Gly Glu Pro Gin Leu Ala Val Arg Gly Ser Thr 
2915 2920 2925 

Val Ala Ala Pro Arg Leu Ala Pro Ala Gly Pro Gly Pro Glu Asp Leu 
2930 2935 2940 

Val Pro Pro Ala Gly Thr Thr Ala Trp Arg Leu Thr Pro Gly Gly Gly 
2945 2950 2955 2960 

Thr Leu Glu Glu Leu Ser Leu Ala Pro Ala Pro Asp Ala Glu Glu Pro 
2965 2970 2975 
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Leu Ala Pro Gly Gin Val Arg lie Ala Val Arg Ala Ala Gly Val Asn 
2980 2985 2990 

Phe Arg Asp Ala Leu lie Ala Leu Gly Met Tyr Pro Gly Lys Gly Thr 
2995 3000 3005 

Met Gly Ala Glu Gly Ala Gly Val Val Val Glu Thr Ala Pro Asp Val 
3010 3015 3020 

Thr Gly Leu Ser Ala Gly Asp Arg Val Leu Gly Met Trp Asn Gly Gly 
3025 3030 3035 3040 

Phe Gly Pro Leu Val Val Ala Asp His Arg Met Val Ala Pro He Pro 
3045 3050 3055 

His Gly Trp Ser Tyr Ala Glu Ala Ala Ser Val Pro Ala Val Leu Leu 
3060 3065 3070 

Thr Ser Tyr Tyr Ala Leu Thr Arg Leu Ala Arg Ala Arg Thr Gly Gin 
3075 3080 3085 

Thr Val Leu Val His Ala Ala Ala Gly Gly Val Gly Met Ala Thr Leu 
3090 3095 3100 

Gin Leu Ala Arg His Leu Gly Leu Glu Val Tyr Ala Thr Ala Ser Thr 
3105 3110 3115 3120 

Gly Lys Trp Agp Ala Leu Gin Lys His Gly He Pro Asp Asp Arg He 
3125 3130 3135 

Ala Asp Ser Arg Thr Leu Asp Phe Ala Glu Arg Phe Leu Ser Arg Thr 
3140 3145 3150 

Gly Gly Arg Gly Val Asp He Val Leu Asn Ser Leu Ala Gly Glu Phe 
3155 3160 3165 

Val Asp Ala Ser Leu Arg Leu Leu Pro Arg Gly Gly His Phe Leu Glu 
3170 3175 3180 

Leu Gly Lys Ala Asp Val Arg Asp Pro Arg Arg He Ala Ala Ala His 
3185 3190 3195 3200 

Pro Gly Thr Asp Tyr Arg Ala Phe Asp Leu Val Gin Ala Gly Pro Asp 
3205 3210 3215 

Thr Val Gly Glu Met Leu Gly Glu Leu Leu Glu Leu Phe Ala Ala Gly 
3220 3225 3230 

Ala Leu Arg Pro Leu Pro Leu Thr Ala Tyr Gly He Arg Asp Ala Arg 
3235 3240 3245 

Thr Ala Leu Arg Thr Leu Ser Gin Ala Arg His Thr Gly Lys Leu Val 
3250 3255 3260 

Leu Thr Val Pro Ala Gly Phe Asp Thr His Arg Thr Val Leu Leu Thr 
3265 3270 3275 3280 

Gly Gly Thr Gly Thr Leu Gly Gin Thr Leu Ala Arg His Leu Val Asn 
3285 3290 3295 
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Arg His Gly Val Arg His Leu Leu Leu Ala Gly Arg Thr Gly Ala Ala 
3300 3305 3310 

Ala Glu Gly Val Ala Glu Leu He Gly Glu Leu Gly Glu Leu Gly Ala 
3315 3320 3325 

Glu Val Arg Val Ala Ala Cys Asp Ala Ala Asp Arg Gin Arg Leu Thr 
3330 3335 3340 

Glu Leu Leu Ala Gly lie Pro Val Glu His Pro Leu Gly Ala Val Val 
3345 3350 3355 3360 

His Ala Ala Gly Thr Leu Asp Asp Gly Thr He Pro Ser Leu Thr Gly 
3365 3370 3375 

Glu Asn He Asp Asn Val Leu Arg Pro Lys Ala Asp Ala Val Leu Asn 
3380 3385 3390 

Leu His Glu Leu Thr Arg Asp Ala Asp Leu Ser Ala Phe Val Leu Tyr 
3395 3400 3405 

Ser Ser Ser Ser Ala Leu Leu Gly Ser Pro Gly Gin Gly Ala Tyr Ala 
3410 3415 3420 

Ala Ala Asn Ala Phe Leu Asp Gly Phe Ala Arg Tyr Arg Lys Gly Leu 
3425 3430 3435 3440 

Gly Leu Pro Ala Leu Ser Leu Ala Trp Gly Leu Trp Gly Ser Asn Ser 
3445 3450 3455 

Arg Met Ala Gly His Leu Asp Gin Ser Gly Met Gin Arg Arg Leu Asn 
3460 3465 3470 

Arg Ser Gly He Met Ala Leu Thr Asp Ala Glu Gly Leu Ala Leu Phe 
3475 3480 3485 

Asp Ala Ala Gin Asp Gly Gly Asp Ala Leu Leu Val Pro Met Arg Leu 
3490 3495 3500 

Asn Arg Thr Ala Leu Arg Ala Ser Gly Arg He Thr Pro Phe Leu Ser 
3505 3510 3515 3520 

Gly Leu Ala Gly Gly Gly Pro Ala Ala Gly Glu Arg Arg Pro Glu Val 
3525 3530 3535 

Ala Ala Val Ser Gly Thr Leu Ala Glu Arg Leu Thr Gly Leu Thr Ala 
3540 3545 3550 

Gin Glu Gly His Ala Leu Val Leu Ala Glu He Arg Ala His Ala Ala 
3555 3560 3565 

Ala Val Leu Gly His Gly Ser Asp Asp Ser He Pro Glu Asp Arg Ala 
3570 3575 3580 

Phe Lys Asp Leu Gly Phe Asp Ser Leu Thr Ala Val Glu Met Arg Asn 
3585 3590 3595 3600 

Arg Leu Ser Ala Ala Thr Gly Leu Arg Leu Pro Ala Thr Leu Val Phe 
3605 3610 3615 
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Asp His Pro Thr Pro Gly Glu Leu Ala Gly His Leu Ser Ala Glu Leu 
3620 3625 3630 

Ser Ala Asp Asp Ala Pro Gly Ser Ala Ser Pro Leu Thr Glu Leu Asp 
3635 3640 3645 

Arg Phe Glu Ala Leu Phe Thr Ala Leu Ala Pro Gly Thr Thr Lys Asp 
3650 3655 3660 

Thr Pro Gly Gly Ala Gly Ala Leu Met lie Asp Glu Ala Glu Arg Gin 
3665 3670 3675 3680 

Glu lie Ala Gly Arg Leu Ala Ala Leu Ala Gly Leu Trp Asn Arg Leu 
3685 3690 3695 

His Gly Thr Thr Thr Ala Pro Glu Asp Gly Asp Thr Val Ala Asp Ala 
3700 3705 3710 

Leu Glu Ala Ala Asp Asp His Glu He Phe Ala Phe Leu Asp Glu Arg 
3715 3720 3725 



Phe * 

3730 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1612 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Asn Ala Asn Glu Gin Gin Leu Arg Ala Tyr Leu Lys Arg Ala 
15 10 15 

Thr Thr Glu Leu His Arg Thr Ser Glu Gin Leu Arg Glu Glu Arg Ala 
20 25 30 

Arg Ala His Glu Pro He Ala Val Val Gly Met Ala Cys Arg Tyr Pro 
35 40 45 

Gly Gly Ala Asn Thr Pro Glu Gin Phe Trp Glu Leu Leu Asp Thr Gly 
50 55 60 

Thr Asp Ala Ala Ala Pro Met Pro Ser Asp Arg Gly Trp Asp Thr His 
65 70 75 80 

Gly Leu Tyr Asp Pro Asp Pro Ala Ala Ala Gly Arg Thr Tyr Cys Arg 
85 90 95 

Glu Gly Gly Phe Leu His Asp Ala Gly Asp Phe Asp Ala Asp Phe Phe 
100 105 110 

Gly He Ser Pro Arg Glu Ala Val Ala Met Asp Pro Gin Gin Arg Leu 
115 120 125 
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Leu Leu Glu Thr Ser Trp Glu Ala lie Glu Ala Ala Gly He Asp Pro 
130 135 140 

Arg Gly Leu Arg Gly Ser Arg Thr Gly Val Tyr Val Gly Ala Trp Asp 
145 150 155 160 

Ser Gly Tyr Thr Gly Gin Ala His Ala Pro Ser Ala Glu Leu Glu Ala 
165 170 175 

Asp Leu Leu Thr Gly Gly Val Val Ser Phe Thr Ser Gly Arg He Ala 
180 185 190 

Tyr Thr Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys 
195 200 205 

Ser Ser Ser Leu Val Ala Leu His Asn Ala Ala Gin Ala Leu Arg Arg 
210 215 220 

Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr 
225 230 235 240 

Pro Ala Val Phe Val Gin Phe Ala Arg Gin Arg Gly Leu Ala Pro Asp 
245 250 255 

Gly Arg Cys Lys Ala Phe Ala Asp Ala Ala Asp Gly Phe Gly Pro Ala 
260 265 270 

Glu Gly Val Gly Met Val Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
275 280 285 

Leu Gly His Pro Val Leu Ala Val Val Cys Gly Ser Ala Val Asn Gin 
290 295 300 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ser Gin Glu 
305 310 315 320 

Arg Val He Arg Gin Ala Leu Gly Asn Ala Arg Leu Thr Val Ala Asp 
325 330 335 

Val Asp Val Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro 
340 345 350 

He Glu Ala Gin Ala Leu Leu Gly Thr Tyr Gly Arg Asp Arg Asp Gly 
355 360 365 

Gly Arg Pro Val Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala 
370 375 380 

Gin Ala Ala Ala Gly Val Ala Gly Val lie Lys Met Val Leu Ala Met 
385 390 395 400 

Arg Tyr Gly Trp Leu Pro Arg Thr Leu His Val Asp Glu Pro Ser Arg 
405 410 415 

His Val Asp Trp Ser Ala Gly Gly Val Arg Leu Leu Thr Glu Ala Arg 
420 425 430 



Glu Trp Pro Gly Val Asp Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 
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Gly Val Ser Gly Thr Asn Ala His Leu lie Leu Glu Ala Pro Asp Thr 
450 455 460 

Ala Glu Ala Glu Ser Ala Thr Thr Pro Val Arg Ser Glu Val Ser Glu 
465 470 475 480 



10 



Ser Ala Ala Val Leu Asp Ala Arg Ser Gly Val Val Pro Val Val Val 
485 490 495 

Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly Arg Leu Ala 
500 505 510 



75 



Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val Ala Val Thr 
515 520 525 

Met Ala Gly Arg Ser Arg Phe Gly Tyr Arg Ala Val Val Leu Ala Arg 
530 535 540 



20 



Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala Gly Gly Asp 
545 550 555 560 



25 



30 



Pro Asp Ala Gly Val Val Thr Gly Ala Val Val Asp Pro Glu Thr Gly 
565 570 575 

Ser Gly Gly Gly Gly Val Val Leu Val Phe Pro Gly Gin Gly Thr Gin 
580 585 590 

Trp Val Gly Met Gly Ala Gly Leu Leu Gly Ser Ser Glu Val Phe Ala 
595 600 605 

Ala Ser Met Arg Glu Cys Ala Arg Ala Leu Ser Val His Val Gly Trp 
610 615 620 



35 



40 



Asp Leu Leu Glu Val Val Ser Gly Gly Ala Gly Leu Glu Arg Val Asp 
625 630 635 640 

Val Val Gin Pro Val Thr Trp Ala Val Met Val Ser Leu Ala Arg Tyr 
645 650 655 

Trp Gin Ala Met Gly Val Asp Val Ala Ala Val Val Gly His Ser Gin 
660 665 670 

Gly Glu He Ala Ala Ala Thr Val Ala Gly Ala Leu Ser Leu Glu Asp 
675 680 685 



45 



50 



Ala Ala Ala Val Val Ala Leu Arg Ala Gly Leu He Gly Arg Tyr Leu 
690 695 700 

Ala Gly Arg Gly Ala Met Ala Ala Val Pro Leu Pro Ala Gly Glu Val 
705 710 715 720 

Glu Ala Gly Leu Ala Lys Trp Pro Gly Val Glu Val Ala Ala Val Asn 
725 730 735 

Gly Pro Ala Ser Thr Val Val Ser Gly Asp Arg Arg Ala Val Ala Gly 
740 745 750 



55 



Tyr Val Ala Val Cys Gin Ala Glu Gly Val Gin Ala Arg Leu He Pro 
755 760 765 
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Val Asp Tyr Ala Ser His Ser Arg His Val Glu Asp Leu Lys Gly Glu 
770 775 780 

Leu Glu Arg Val Leu Ser Gly lie Arg Pro Arg Ser Pro Arg Val Pro 
785 790 795 800 

Val Cys Ser Thr Val Ala Gly Glu Gin Pro Gly Glu Pro Val Phe Asp 
805 810 815 

Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn Arg Val Glu Phe Ser Ala 
820 825 830 

Val Val Gly Gly Leu Leu Glu Glu Gly His Arg Arg Phe lie Glu Val 
835 840 845 

Ser Ala His Pro Val Leu Val His Ala He Glu Gin Thr Ala Glu Ala 
850 855 860 

Ala Asp Arg Ser Val His Ala Thr Gly Thr Leu Arg Arg Gin Asp Asp 
865 870 875 880 

Ser Pro His Arg Leu Leu Thr Ser Thr Ala Glu Ala Trp Ala His Gly 
885 890 895 

Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro Pro Gly His Leu Thr Thr 
900 905 910 

Leu Pro Thr Tyr Pro Phe Asn His His His Tyr Trp Leu Asp Thr Thr 
915 920 925 

Pro Thr Thr Pro Ala Thr Thr Thr Gin Ser Pro Thr Asp Ala Trp Arg 
930 935 940 

Tyr Arg Val Thr Trp Lys Ala Leu Thr Glu Glu Ser Thr Pro Ala Ser 
945 950 955 960 

Ser Pro Ser Gly His Trp Leu Leu Val Thr Pro Pro Thr Pro Glu Gly 
965 970 975 

Arg Thr Leu Gly Asp Arg Ala Ala Gly Ala Leu Ala Arg Gin Gly Ala 
980 985 990 

Thr Val Glu Arg Leu Val Val Asp Pro Val Ala Val Gly Arg Asp Gly 
995 1000 1005 

Leu Ala Ala Arg Leu Gly Glu Arg Trp Asp Gly Val Leu Ser Leu Leu 
1010 1015 1020 

Gly Ala Asp Glu Arg Pro Leu Pro Arg His Pro Ala Leu Asn Arg Ala 
1025 1030 1035 1040 

Val Met Gly Thr Thr Leu Leu Ala Gin Ala Ala Leu Asp Ala Gly Cys 
1045 1050 1055 

Glu Ala Arg He Trp Ala Val Thr Arg Glu Ala Val Ala Val Ser Pro 
1060 1065 1070 

Ser Glu Val Pro Arg Asp Ala Gly Ala Gin Leu Trp Gly Leu Gly Arg 
1075 1080 1085 
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Gly lie Ala Leu Glu His Pro Ser Leu Trp Gly Gly Leu lie Asp Leu 
1090 1095 1100 

Pro Ala Val Pro Asp Glu Arg Ala Trp Ala Arg Ala Val Arg Arg Leu 
1105 1110 1115 1120 

Val Pro His Gly Glu Asp Gin He Ala Ala Arg Ala Ser Gly Ala Tyr 
1125 1130 1135 

Gly Arg Arg Leu Leu Pro Ala Pro Pro Ala Ala Ser Arg Arg Thr Cys 
1140 1145 1150 

Thr Pro Ser Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
1155 1160 1165 

Gly His Leu Ala Arg Arg Leu Ala Arg Gly Gly Thr Gly His Leu Val 
1170 1175 1180 

Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Gly Glu Leu Ala 
1185 1190 1195 1200 

Gly Glu Leu Ala Ser Leu Gly Ala Lys Val Thr Val Ala Ala Cys Asp 
1205 1210 1215 

Met Ala Asp Arg Glu Ala Val Arg Ala Leu Leu Asp Glu His Arg Pro 
1220 1225 1230 

Thr Ala Val Phe His Thr Ala Gly Thr Pro His Ser Ala Glu Phe Thr 
1235 1240 1245 

Ala Leu Asp Glu Thr Thr Thr Ala Gly Val Tyr Gly Gly Lys Val Leu 
1250 1255 1260 

Gly Ala Arg His Leu Asp Glu Leu Thr Arg Glu Leu Gly He Gly Leu 
1265 1270 1275 1280 

Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Ala Val Trp Gly Ser Gly 
1285 1290 1295 

Gly Gin Thr Ala Tyr Gly Ala Ala Asn Ala Ala Leu Asp Ala Leu Ala 
1300 1305 1310 

Glu Arg Arg Arg Ala Ala Gly Leu Pro Ala Thr Ser Val Ala Trp Gly 
1315 1320 1325 

Leu Trp Gly Gly Gly Gly Met Gly Glu Gly Asp Gly Glu Glu Phe Leu 
1330 1335 1340 

Ser Arg Arg Gly Leu Gly Val Met Pro Pro Glu Asp Ala Leu Glu Ala 
1345 1350 1355 1360 

Leu Asp Arg Ala Leu Asp Arg Glu Asp Thr Thr Val Val Val Ala Asp 
1365 1370 1375 

Val Asp Trp Glu Arg Phe Ala Pro Ala Phe Thr Ala Phe Arg Pro Ser 
1380 1385 1390 

Ala Leu He Ser Arg Leu Val Ser Asp Gly Gly Glu Ala Gly Gly Gin 
1395 1400 1405 
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Asp Ala Pro Asp Gly Thr Leu Phe Ala Ala Gly Phe Ala Ala Ala Gly 
1410 1415 1420 

Pro Leu Glu Arg Gin Glu Met Leu Leu Gly Leu Val Arg Arg His Val 
1425 1430 1435 1440 

Ala Ala Val Leu Gly His Pro Gly Thr Ala Asp lie Gly Pro Asp Arg 
1445 1450 1455 

Ala Phe Lys Glu Leu Gly Phe Ser Ser Val Thr Ala Val Glu Leu Ala 
1460 1465 1470 

Gly Arg Leu Gly Arg Glu Cys Gly Arg Lys Leu Pro Pro Thr Leu Val 
1475 1480 1485 

Phe Asp His Pro Thr Ala Ala Ala Ala Val Glu His Leu Ala Glu Leu 
1490 1495 1500 

Leu Thr Pro Pro Ala Gly Pro Ala Ala Gly Pro Arg Glu Glu Glu Ala 
1505 1510 1515 1520 

Arg Ala Ala Leu Ala Arg Val Pro Leu Glu Arg Leu Arg Glu Ala Gly 
1525 1530 1535 

Leu Leu Asp Ala Leu Leu Arg Leu Ala Ala Asp Glu Ser Gly Ala Thr 
1540 1545 1550 

Thr Pro Arg Thr Ser Ala Ala Ser Gly Ala Pro Arg Gly Arg Glu Glu 
1555 1560 1565 

Pro Asp Gly Arg Gly Glu Pro Asp Gly Ser Gly His Arg Glu Ser Pro 
1570 1575 1580 

Asp Ala Ala Gly Gly Ser Asp Ala Leu Asp Asp Leu Asp Gly Asp Ala 
1585 1590 1595 1600 

Leu Val Arg Leu Ala Leu Gly Glu Pro Gly Glu * 
1605 1610 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Met Ser Ala Glu Arg Leu Thr Glu Ala Leu Arg Thr Ser Leu 
15 10 15 

Lys Glu Ala Glu Arg Leu Arg Arg Gin Asn Arg Glu Leu Arg Ala Ala 
20 25 30 

Arg Asp Ala Ala Arg Glu Pro lie Ala Val Val Gly Met Ala Cys Arg 
35 40 45 
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Tyr Pro Gly Gly Val Thr Gly Pro Glu Glu Leu Trp Glu Leu Val Ala 
50 .55 60 

Gly Gly Arg Asp Ala He Gly Pro Phe Pro Val Asp Arg Gly Trp Asp 
65 70 75 80 

Val Ala Ser Val Tyr Asp Pro Asp Pro Glu Ser Lys Gly Thr Thr Tyr 
85 90 95 

Cys Arg Glu Gly Gly Phe Leu Glu Gly Ala Gly Asp Phe Asp Ala Ala 
100 105 no 

Phe Phe Gly He Ser Pro Arg Glu Ala Leu Val Met Asp Pro Gin Gin 
115 120 125 

Arg Leu Leu Leu Glu Val Ser Trp Glu Ala Leu Glu Arg Ala Gly He 
130 135 140 

Asp Pro Ser Ser Leu Arg Gly Ser Arg Gly Gly Val Tyr Val Gly Ala 
145 150 155 160 

Ala His Gly Ser Tyr Ala Ser Asp Pro Arg Leu Val Pro Glu Gly Ser 
165 170 175 

Glu Gly Tyr Leu Leu Thr Gly Ser Ala Asp Ala Val Met Ser Gly Arg 
180 185 190 

He Ser Tyr Ala Leu Gly Leu Glu Gly Pro Ser Met Thr Val Glu Thr 
195 200 205 

Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Arg Ala Leu 
210 215 220 



Arg His Gly Glu Cys Gly Leu Ala Leu Ala Gly Gly Val Ala Val Met 
225 230 235 240 

Ala Asp Pro Ala Ala Phe Val Glu Phe Ser Arg Gin Lys Gly Leu Ala 
245 250 255 

Ala Asp Gly Arg Cys Lys Ala Phe Ser Ala Ala Ala Asp Gly Thr Gly 
260 265 270 

Trp Ala Glu Gly Val Gly Val Leu Val Leu Glu Arg Leu Ser Asp Ala 
275 280 285 

Arg Arg Ala Gly His Thr Val Leu Gly Leu Val Thr Gly Thr Ala Val 
290 295 300 

Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Gly Pro Ala 
305 310 315 320 

Gin Gin Arg Val He Ala Glu Ala Leu Ala Asp Ala Gly Leu Ser Pro 
325 330 335 

Glu Asp Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly 
340 345 350 

Asp Pro He Glu Ala Gly Ala Leu Leu Ala Ala Ser Gly Arg Asn Arg 
355 360 365 
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Ser Gly Asp His Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly 
370 375 380 

His Ala Gin Ala Ala Ala Gly Val Gly Gly Val He Lys Met Leu Gin 
385 390 395 400 

Ala Leu Arg His Gly Leu Leu Pro Arg Thr Leu His Ala Asp Glu Pro 
405 410 415 

Thr Pro His Ala Asp Trp Ser Ser Gly Arg Val Arg Leu Leu Thr Ser 
420 425 430 



15 



Glu Val Pro Trp Gin Arg Thr Gly Arg Pro Arg Arg Thr Gly Val Ser 
435 440 445 



Ala Phe Gly Val Gly Gly Thr Asn Ala His Val Val Leu Glu Glu Ala 
450 455 460 



20 



Pro Ala Pro Pro Ala Pro Glu Pro Ala Gly Glu Ala Pro Gly Gly Ser 
465 470 475 480 



Arg Ala Ala Glu Gly Ala Glu Gly Pro Leu Ala Trp Val Val Ser Gly 
485 490 495 



25 



Arg Asp Glu Pro Ala Leu Arg Ser Gin Ala Arg Arg Leu Arg Asp His 
500 505 510 



Leu Ser Arg Thr Pro Gly Ala Arg Pro Arg Asp He Ala Phe Ser Leu 
515 520 525 



30 



Ala Ala Thr Arg Ala Ala Phe Asp His Arg Ala Val Leu He Gly Ser 
530 535 540 



35 



Asp Gly Ala Glu Leu Ala Ala Ala Leu Asp Ala Leu Ala Glu Gly Arg 
545 550 555 560 

Asp Gly Pro Ala Val Val Arg Gly Val Arg Asp Arg Asp Gly Arg Met 
565 570 575 



Ala Phe Leu Phe Thr Gly Gin Gly Ser Gin Arg Ala Gly Met Ala His 
580 585 590 

Asp Leu His Ala Ala His Thr Phe Phe Ala Ser Ala Leu Asp Glu Val 
595 600 605 



45 



Ttix Asp Arg Leu Asp Pro Leu Leu Gly Arg Pro Leu Gly Ala Leu Leu 
610 615 620 



Asp Ala Arg Pro Gly Ser Pro Glu Ala Ala Leu Leu Asp Arg Thr Glu 
625 630 635 640 



50 



Tyr Thr Gin Pro Ala Leu Phe Ala Val Glu Val Ala Leu His Arg Leu 
645 650 655 



Leu Glu His Trp Gly Met Arg Pro Asp Leu Leu Leu Gly His Ser Val 
660 665 670 



55 



Gly Glu Leu Ala Ala Ala His Val Ala Gly Val Leu Asp Leu Asp Asp 
675 680 685 
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Ala Cys Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gin Arg Leu Pro 
690 695 700 

Pro Gly Gly Ala Met Val Ser Val Arg Ala Gly Glu Asp Glu Val Arg 
705 710 715 720 

Ala Leu Leu Ala Gly Arg Glu Asp Ala Val Cys Val Ala Ala Val Asn 
725 730 735 

Gly Pro Arg Ser Val Val He Ser Gly Ala Glu Glu Ala Val Ala Glu 
740 745 750 

Ala Ala Ala Gin Leu Ala Gly Arg Gly Arg Arg Thr Arg Arg Leu Arg 
755 760 765 

Val Ala His Ala Phe His Ser Pro Leu Met Asp Gly Met Leu Ala Gly 
770 775 780 

Phe Arg Glu Val Ala Ala Gly Leu Arg Tyr Arg Glu Pro Glu Leu Thr 
785 790 795 800 

Val Val Ser Thr Val Thr Gly Arg Pro Ala Arg Pro Gly Glu Leu Thr 
805 810 815 

Gly Pro Asp Tyr Trp Val Ala Gin Val Arg Glu Pro Val Arg Phe Ala 
820 825 830 

Asp Ala Val Arg Thr Ala His Arg Leu Gly Ala Arg Thr Phe Leu Glu 
835 840 845 

Thr Gly Pro Asp Gly Val Leu Cys Gly Met Ala Glu Glu Cys Leu Glu 
850 855 860 

Asp Asp Thr Val Ala Leu Leu Pro Ala He His Lys Pro Gly Thr Ala 
865 870 875 880 

Pro His Gly Pro Ala Ala Pro Gly Ala Leu Arg Ala Ala Ala Ala Ala 
885 890 895 

Tyr Gly Arg Gly Ala Arg Val Asp Trp Ala Gly Met His Ala Asp Gly 
900 905 910 

Pro Glu Gly Pro Ala Arg Arg Val Glu Leu Pro Val His Ala Phe Arg 
915 920 925 

His Arg Arg Tyr Trp Leu Ala Pro Gly Arg Ala Ala Asp Thr Asp Asp 
930 935 940 



Trp Met Tyr Arg He Gly Trp Asp Arg Leu Pro Ala Val Thr Gly Gly 
945 950 955 960 

Ala Arg Thr Ala Gly Arg Trp Leu Val He His Pro Asp Ser Pro Arg 
965 970 975 

Cys Arg Glu Leu Ser Gly His Ala Glu Arg Ala Leu Arg Ala Ala Gly 
980 985 990 

Ala Ser Pro Val Pro Leu Pro Val Asp Ala Pro Ala Ala Asp Arg Ala 
995 1000 1005 
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Ser Phe Ala Ala Leu Leu Arg Ser Ala Thr Gly Pro Asp Thr Arg Gly 
1010 1015 1020 

Asp Thr Ala Ala Pro Val Ala Gly Val Leu Ser Leu Leu Ser Glu Glu 
1025 1030 1035 1040 

Asp Arg Pro His Arg Gin His Ala Pro Val Pro Ala Gly Val Leu Ala 
1045 1050 1055 

Thr Leu Ser Leu Met Gin Ala Met Glu Glu Glu Ala Val Glu Ala Arg 
1060 1065 1070 

Val Trp Cys Val Ser Arg Ala Ala Val Ala Ala Ala Asp Arg Glu Arg 
1075 1080 1085 

Pro Val Gly Ala Gly Ala Ala Leu Trp Gly Leu Gly Arg Val Ala Ala 
1090 1095 1100 

Leu Glu Arg Pro Thr Arg Trp Gly Gly Leu Val Asp Leu Pro Ala Ser 
1105 1110 1115 1120 

Pro Gly Ala Ala His Trp Ala Ala Ala Val Glu Arg Leu Ala Gly Pro 
1125 1130 1135 

Glu Asp Gin lie Ala Val Arg Ala Ser Gly Ser Trp Gly Arg Arg Leu 
1140 1145 1150 

Thr Arg Leu Pro Arg Asp Gly Gly Gly Arg Thr Ala Ala Pro Ala Tyr 
1155 1160 1165 

Arg Pro Arg Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
1170 1175 1180 

Gly His Leu Ala Arg Trp Leu Ala Ala Ala Gly. Ala Glu His Leu Ala 
1185 1190 1195 1200 

Leu Thr Ser Arg Arg Gly Pro Asp Ala Pro Gly Ala Ala Gly Leu Glu 
1205 1210 1215 

Ala Glu Leu Leu Leu Leu Gly Ala Lys Val Thr Phe Ala Ala Cys Asp 
1220 1225 1230 

Thr Ala Asp Arg Asp Gly Leu Ala Arg Val Leu Arg Ala lie Pro Glu 
1235 1240 1245 

Asp Thr Pro Leu Thr Ala Val Phe His Ala Ala Gly Val Pro Gin Val 
1250 1255 1260 

Thr Pro Leu Ser Arg Thr Ser Pro Glu His Phe Ala Asp Val Tyr Ala 
1265 1270 1275 1280 

Gly Lys Ala Ala Gly Ala Ala His Leu Asp Glu Leu Thr Arg Glu Leu 
1285 1290 1295 

Gly Ala Gly Leu Asp Ala Phe Val Leu Tyr Ser Ser Gly Ala Gly Val 
1300 1305 1310 

Trp Gly Ser Ala Gly Gin Gly Ala Tyr Ala Ala Ala Asn Ala Ala Leu 
1315 1320 1325 
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Asp Ala Leu Ala Arg Arg Arg Ala Ala Asp Gly Leu Pro Ala Thr Ser 
1330 1335 1340 

lie Ala Trp Gly Val Trp Gly Gly Gly Gly Met Gly Ala Asp Glu Ala 
1345 1350 1355 1360 

Gly Ala Glu Tyr Leu Gly Arg Arg Gly Met Arg Pro Met Ala Pro Val 
1365 1370 1375 

Ser Ala Leu Arg Ala Met Ala Thr Ala He Ala Ser Gly Glu Pro Cys 
1380 1385 1390 

Pro Thr Val Thr His Thr Asp Trp Glu Arg Phe Gly Glu Gly Phe Thr 
1395 1400 1405 

Ala Phe Arg Pro Ser Pro Leu He Ala Gly Leu Gly Thr Pro Gly Gly 
1410 1415 1420 

Gly Arg Ala Ala Glu Thr Pro Glu Glu Gly Asn Ala Thr Ala Ala Ala 
1425 1430 1435 1440 

Asp Leu Thr Ala Leu Pro Pro Ala Glu Leu Arg Thr Ala Leu Arg Glu 
1445 1450 1455 

Leu Val Arg Ala Arg Thr Ala Ala Ala Leu Gly Leu Asp Asp Pro Ala 
1460 1465 1470 

Glu Val Ala Glu Gly Glu Arg Phe Pro Ala Met Gly Phe Asp Ser Leu 
1475 1480 1485 

Ala Thr Val Arg Leu Arg Arg Gly Leu Ala Ser Ala Thr Gly Leu Asp 
1490 1495 1500 

Leu Pro Pro Asp Leu Leu Phe Asp Arg Asp Thr Pro Ala Ala Leu Ala 
1505 1510 1515 1520 

Ala His Leu Ala Glu Leu Leu Ala Thr Ala Arg Asp His Gly Pro Gly 
1525 1530 1535 

Gly Pro Gly Thr Gly Ala Ala Pro Ala Asp Ala Gly Ser Gly Leu Pro 
1540 1545 1550 

Ala Leu Tyr Arg Glu Ala Val Arg Thr Gly Arg Ala Ala Glu Met Ala 
1555 1560 1565 

Glu Leu Leu Ala Ala Ala Ser Arg Phe Arg Pro Ala Phe Gly Thr Ala 
1570 1575 1580 

Asp Arg Gin Pro Val Ala Leu Val Pro Leu Ala Asp Gly Ala Glu Asp 
1585 1590 1595 1600 

Thr Gly Leu Pro Leu Leu Val Gly Cys Ala Gly Thr Ala Val Ala Ser 
1605 1610 1615 

Gly Pro Val Glu Phe Thr Ala Phe Ala Gly Ala Leu Ala Asp Leu Pro 
1620 1625 1630 

Ala Ala Ala Pro Met Ala Ala Leu Pro Gin Pro Gly Phe Leu Pro Gly 
1635 1640 1645 
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Glu Arg Val Pro Ala Thr Pro Glu Ala Leu Phe Glu Ala Gin Ala Glu 
1650 1655 1660 

Ala Leu Leu Arg Tyr Ala Ala Gly Arg Pro Phe Val Leu Leu Gly His 
1665 1670 1675 1680 

Ser Ala Gly Ala Asn Met Ala His Ala Leu Thr Arg His Leu Glu Ala 
1685 1690 1695 

Asn Gly Gly Gly Pro Ala Gly Leu Val Leu Met Asp lie Tyr Thr Pro 
1700 1705 1710 

Ala Asp Pro Gly Ala Met Gly Val Trp Arg Asn Asp Met Phe Gin Trp 
1715 1720 1725 

Val Trp Arg Arg Ser Asp lie Pro Pro Asp Asp His Arg Leu Thr Ala 
1730 1735 1740 

Met Gly Ala Tyr His Arg Leu Leu Leu Asp Trp Ser Pro Thr Pro Val 
1745 1750 1755 1760 

Arg Ala Pro Val Leu His Leu Arg Ala Ala Glu Pro Met Gly Asp Trp 
1765 1770 1775 

Pro Pro Gly Asp Thr Gly Trp Gin Ser His Trp Asp Gly Ala His Thr 
1780 1785 1790 

Thr Ala Gly He Pro Gly Asn His Phe Thr Met Met Thr Glu His Ala 
1795 1800 1805 

Ser Ala Ala Ala Arg Leu Val His Gly Trp Leu Ala Glu Arg Thr Pro 
1810 1815 1820 

Ser Gly Gin Gly Gly Ser Pro Ser Arg Ala Ala Gly Arg Glu Glu Arg 
1825 1830 1835 1840 

Pro * 



(2) INFORMATION FOR SEQ ID NO:7: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44377 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 350.. 14002 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 14046.. 20036 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 20110.. 3:284 

( ix ) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 31329.. 36071 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 36155.. 41830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GACCGCTCGG GGAGACCTGA CATATTCGTC GCGAAGTGGT TGTCCGCGCC GCGAGGTACT 
GAAATCTTCT CCGCTCGCCC AGGACTCCGC GTGCAGGTCA CCGGAGTGCG CGACCGGCCG 
GGACGTCGGA GCGCCGACCC TGCGGACCTC GTCCGATGCC GTCTGGTCCC GCATGATCCC 
GCGCCGTCTC CGGTGACGAG AATCGGTGGA CAATCTCCGA ACTTGACACA ATTGATTGTC 
GTTCACCGGC CGTTCCTGTC GCCCGGCAGT TCGCCCGCTG TACGCTCGGG AAGATCAAGA 
AAAGGCAGAA AAGCCACGGC GTGGTACGGC GAACATATGA GGGATGCAGG TGTCTGGAGA 
ACTCGCGATT TCCCGCAGTG ACGACCGGTC CGACGCCGTT GCCGTGGTCG GAATGGCGTG 
CCGGTTTCCC GGCGCCCCGG GAATTGCCGA ATTCTGGAAA CTGCTGACCG ACGGAAGGGA 
CGCGATCGGC CGGGACGCCG ACGGCCGCCG GCGCGGCATG ATCGAGGCGC CCGGCGACTT 
CGACGCCGCC TTCTTCGGCA TGTCACCCCG CGAGGCCGCC GAGACCGACC CCCAGCAGCG 
CCTGATGCTC GAACTCGGCT GGGAGCCTCT GGAGGACGCC GGCATCGTCC CCGGCTCCCT 
GCGCGGCGAG GCGGTCGGCG TCTTCGTCGG GGCCATCCAC GACGACTACG CCACCCTGCT 
CCACCGCGCC GGCGCCCCGG TCGGCCCCCA CACCGCCACC GGCCTCCAGC GCGCCATGCT 
CGCCAACCGG CTCTCCTACG TCCTGGGGAC GCGCGGCCCC AGCCTCGCGG TCGACACCGC 
CCAGTCGTCC TCCCTGGTCG CCGTGGCCCT CGCCGTCGAG AGCCTGCGGG CCGGCACCTC 
CCGCGTCGCC GTCGCCGGGG GCGTCAACCT GGTCCTCGCC GACGAGGGAA CGGCCGCCAT 
GGAACGCCTC GGCGCGCTGT CACCCGACGG CCGCPGCCAC ACCTTCGACG CCCGTGCCAA 
CGGCTATGTC CGCGGTGAGG GCGGCGCCGC CGTCGTCCTG AAGCCCCTCG CCGACGCCCT 
GGCCGACGGG GACCCCGTGT ACTGCGTGGT GCGTGGCGTC GCCGTCGGCA ACGACGGCGG 
CGGCCCCGGG CTGACCGCTC CCGACCGCGA OGGACAGGAG GCGGTGCTCC GGGCCGCCTG 
CGCCCAGGCC CGGGTCGACC CCGCCGAGGT GCGTTTCGTC GAACTGCACG GCACGGGAAC 
CCCGGTGGGC GACCCGGTCG AGGCACACCC CCTCGGCGCG GTGCACGGCT CCGCTCGGCC 
GGCCGACGAC CCCCTGCTGG TGGGGTCGGT GAAGACCAAC ATCGGCCACC TGGAGGGCGC 
CGCCGGCATC GCGGGCCTGG TCAAGGCCGC ACTCTGCCTG CGGGAACGCA CCCTTCCCGG 
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CTCGCTGAAC 
GCAGACCGCT 

5 

CAGTTCGTTC 
GCCCACCCCG 

10 CGCGCGGTCG 
GGTGGGCGCG 
GCACCGTGCG 

75 TGCTGCCGGG 
GGTGTTGTTC 
GGGTGGGGTC 

20 

TCGGTCGTTG 
TGCGGGTGCC 
GGGTCGGACG 

25 

GTTGGAGGCT 
TGCTGCGTAT 

30 GGGTGGGTTG 
GTCGGTGGTG 
TGGGCCGCGG 

55 GCTGATGGGG 
GTTGATGGAG 
GGTGCGGCCG 

40 

GTTGGGGGAT 
GGTGGGGGTG 
A - GCTCACGGGG 

45 

GATGCGGCGG 
CCGGGACGCC 
50 CCTCCCCACC 
CACGGCCGAC 
GGACCCGGAG 

55 

GTCACCGGAG 



TTCGCCACCC CCTCTCCGGC 
GCCGCCGAGC TCCCGCTCGC 
GGCATCGGTG GCACCAACTG 
GCCGTCTCCG TCGCCGCCTC 
GAGGGGGCGT TGCGGGCGCA 
GATCCGCGGG ATGTGGCTTA 
GTGGTGCCGT GTGGTGGGCG 
AGGGTGTCTG GGGGTGTGCG 
ACGGGTCAGG GTGCGCAGTG 
TTTGCGGAGG TGCTGGATGA 
CGGGATGTGA TGTTCGGCGA 
GGTGCGGGTG CTGGGGTCGG 
GAGTTTGCTC AGCCTGCGTT 
CGGGGTCTGG AGGTGTCGGT 
GTGGCGGGGG TGTTGTCGTT 
ATGGGTGGGT TGCCGGTGGG 
CGGGGGGTTG TTGAGGGGTT 
TCGGTGGTGT TGTCGGGTGA 
GATGGGGTGG AGTGCCGGCG 
CCGGTGTTGG GGGAGTTCCG 
GGTGTGGTGG TCGTGTCGGG 
CCGGGGTATT GGGTGCGTCA 
GTGCGTGGTC TGGGTGTGGG 
ATGGCGGGTG AGTGCCTGGG 
GGCCGTGCGG AGCGGGAGGT 
GGCCTGGACG CCACGGCACT 
TACCCCTTCC AACGCCGTAC 
CCCGGGGCGG GTGTGACCGC 
AGCACCGAGG GGACGTCCCA 
CCGATGTCCC CCGAGGATGC 
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CATCCCGCTG GACCAGCTCC 
CCCGGGCGGC GCACCCCTGC 
CCATCTGGTC CTGGAACACC 
GCTTCCGGAC GTCCCGCCGC 
GGCGGTGCGG TTGGGTGAGT 
TTCGCTGGCT TCGACGCGGA 
TGGGGAGCTC GTCGCTGCTC 
GTCCGGGCGG GCTGTGCCGG 
GGTTGGTATG GGGCGTGGGT 
GGTCTTGTCG ATGGTGGGGG 
CGTCGACGTG GACGCGGGTG 
TTCTGGTTCC GGTTCTGTGG 
GTTTGCGTTG GAGGTGGCGT 
GGTGTTGGGT CATTCGGTCG 
GGGTGATGCG GTGCGGTTGG 
TGGGGGGATG TGGTCGGTGG 
GGGGGAGTGG GTGTCGGTTG 
TGTGGGTGTG CTGGAGTCGG 
GTTGGATGTG TCGCATGGGT 
GGGGGTTGTG GAGTCGTTGG 
TGTGTCGGGT GGGGTGGTGG 
TGCGCGGGAG GCGGTGCGTT 
GACGTTGGTG GAGGTGGGTC 
GGCCGGTGAT GATGTGGTGG 
GTTCGAGGCG GCGCTGGCGA 
CCACACCGGG AGCACCGGCC 
CCACTGGTCG CCCGCGCTGA 
CACCGATGCC GTGGGGCACA 
CAGGGACACG GACGACGAGG 
CGTCCGCCTG GTCCGCGAGA 



GGCTGAAGGT 1500 

TCGCGGGTGT 1560 

TGCCCTCCCG 1620 

TGTTGTTGTC 1680 

ACGTGGAGCG 1740 

CTCTTTTCGA 1800 

TTGGTGGGTT 1860 

GTGGGGTGGG 1920 

TGTATGCGGG 1980 

AGGTGGATGG 2040 

CCGGGGCTGA 2100 

GTGGGTTGTT 2160 

TGTTCCGGGC 2220 

GGGAGGTGGC 2280 

TGGTGGCGCG 2340 

GGGCGTCGGA 2400 

CGGCGGTGAA 2460 

TGGTTGCCTC 2520 

TTCATTCGGT 2580 

AGTTCGGTCG 2640 

GTTCGGGGGA 2700 

TCGCGGATGG 2760 

CGCATGGGGT 2820 

TOGTGCCGGC 2880 

CGGTGTTCAC 2940 

GGCGCATCGA 3000 

GCCGGCCGGT 3060 

GCGTCTCCCC 3120 

CGGACTCGGC 3180 

GCACCGCGGC 3240 
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CGTCCTGGGC 


WtW\>AwW\ 1 v. 


C LaAjCGAGGT CGCGCTCGAC CGCACC7TTCA CCTCCCAGGG 


3300 


5 




V* 4V*AV»Vviv»Uv» 


TCGAQjCTGTG CGACC ivCTTG AAGGGCGCCT CGGGGCTCCC 


3360 




CCTCGCCGCC 


ACGCTGGTCT 


ACGACCTGCC CACCCCGCGT GCCGTCGCCG AGCACATCGT 


3420 




GGAAGC C t»CG 


GGCvAKjCC. CiA 


AGGACTCGGT TGCCGGTGGG CCCGGAGTGC TCTCGTCGGC 


3480 


10 


I AGGG 


/'TVTWV* A 

Glo IvJUUAUU 


CCCGGGGCGG CAGCCGGGAC GACGACGACC CGATCGCCAT 


3540 




CGTGGGTGTC 


GGCTGCCGGC 


TCCCCGGCGG CGTCGACTCG CGCGCCGCTC TCTGGGAGCT 


3600 




GC 1\jCjAGTCC 


GGCGCCGACG 


CCATCTCGTC CTTCCCCACC GACCGCGGCT GGGACCTCGA 


3660 


IS 


cliUOU I\j 1 At- 


GAt-CCLUAGC 


CCGGGACGCC CGGCAAGACC TATGTGCGGG AGGGCGGGTT 


3720 






GC\»GC.(»UAG I 


TCGACGCGGA GTTCTTCGGG ATATCGCCGC GCGAGGCCAC 


3780 


20 


GOCCATGGAC 


CCGCAGCAGC 


GCTTGCTGCT GGAAGCGTCG TGGGAGGCCC TCGAGGACGC 


3840 




CCCGAGTCAC 


TGCGCGGCGG CGACGCCGGA GTGTTCGTCG GCGCCACCGC 


3900 




ALt-GOAC* I AC. 


GGGCCGAGGC 


TTCACGAGGG AGCGGACGGA TACGAGGGGT ACCTGCTCAC 


3960' 


25 






CCTCCGGCCG GATCGCCTAC ACCCTCGGCA CCGGCGGACC 


4020 






l» J.\-GAL.At-L.lj 


CGTGCTCCTC GTCCCTCGTG GCGCTGCACC TGGCCGTGCA 


4080 






CUUGG4.GAG 1 


GCGGGCTGGC TCTGGCGGGC GGCGCCACGG TX3ATGTCGGG 


4140 


30 




1 Ia-G IvAiAG 1 


TCTCGCGGCA GCGCGGGCTC GCCCCCGACG GCCGCTGCAT 


4200 








ACUO 1 AvJGGC CTGGTCCGAG GGTGTCGCCG TACTGGCACT 


4260 








GGCGTGCGGG ACACCGGGTG CTGGGCGTGG TGCGGGGCAG 


4320 


35 




V.i«UOAV.V9V> Iv* 


Lv-AOCAACGG CCTGACCGCT CCCAACCGCT CCGCGCAGGA 


4380 




uv*vvv^\j icn 1 v 




lUGCCGACGC CGGCCTCGCG CCGGGTGACG TCGACGCGGT 


4440 


40 


GGAC5GCGCAC 




uvi^vvciuuo CGATCCGATC GAGGCGAGCG CGCTGCTGGC 


4500 


CACGTACGGG 


CGTGAGCGGG 


TGGGCGACCC CTTGTGGCTC GGGTCGCTGA AGTCCAACGT 


4560 




CGGTCACACC 


CAGGCCGCCG 


CGGGGGCCGC GGGTGTGGTC AAGATCCTGC TTGCCCTGGA 


4620 


45 


GCACGGCACG 


CTGCCGCGGA 


CACTTCACGC GGACCGGCCC AGCACGCACG TCGACTGGTC 


4680. 




GTCGGGCACC 


GTCGCCCTGC 


TGGCAGAGGC GCGCCGGTGG CCCCGGCGGT CGGACCGCCC 


4740 




GCGCCGGGCG GCTCTGTCGT CGTTCGGGAT CAGTGGGACG AACGCGCATC TGATCATCGA 


4800 


SO 


GGAGGCGCCG 


GAGTGGGTCG 


AGGACATCGA CGGCGTCGCT GCTCCTGACC GCGGTACCGC 


4860 




GGACGCGGCT 


GCTCCGTCCC 


CGCTCTTGTT GTCCGCGCGG TCGGAGGGGG CGTTGCGGGC 


4920 


55 


GCAGGCGGTG 


CGGTTGGGTG AGTACGTGGA GCGGGTCGGT GCGGATCCGC GGGATGTGGC 


4980 


TTATTCGCTG GCTTCGACGC GGACTCTTTT CGAGCACCGT GCGGTGGTGC CGTGTGGTGG 


5040 
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GCGTGGGGAG CTCGTCGCTG CTCTTGGTCG GTTTOCTGCC GGGAGGGTGT CTGGGGGTGT 
GCGGTCCGGG CGGGCTGTCC CGGGTCGGGT GGGGGTGTTG TTCACGGGTC AGGGTGCGCA 
GTGGGTTGGT ATGGGGCGTG GGTTGTATGC GGGGGGTGGG GTGTTTGCGG AGGTCCTGGA 
TGAGGTCTTG TCGATGGTGG GGGAGGTCGA TCGTCGGTCG TTGCGGGATG TGATGTTCGG 
CGACGTCGAC GTGGACGCGG GTGCCGGGGC TGATGCGGGT GCCGGTGCGG GTGCTGGGGT 
CGCTTCTGGT TCCGGTTCTC TGGCTCGGTT GTTGGGTCGG ACGCACTTTC CTCAGCCTGC 
GCTGTTTGCG TTGGAGGTGG CGTTGTTCCG GGCGTTGGAG GCTCGGGGTG TGGAGGTCTC 
GGTGGTGTTG GGTCATTCGG TGGGGGAGGT GGCTGCTGCG TATCTGGCGG GGGTGTTGTC 
GTTGGGTGAT GCGGTGCGGT TGGTGGTGGC GCGGGGTGGG TTGATGGGTC GGTTGCCGGT 
GGGTGGGGGG ATGTGGTCGG TGGGGGCGTC GGAGTCGGTG GTGCGGGGGG TTGTTGAGGG 
GTTGGGGGAG TGGGTGTCGG TTGCGGCGGT GAATGGGCCG CGGTCGGTGG TCTTGTCGGG 
TGATGTGGGT GTGCTGGAGT CGGTGGTTGC CTCGCTGATG GGGGATGGGG TGGAGTCCCG 
GCGGTTGGAT GTGTCGCATG GGTTTCATTC GGTGTTGATG GAGCCGG7GT TGGGGGAGTT 
CCGGGGGGTT GTGGAGTCGT TGGAGTTCGG TCGGGTGCGG CCGGGTGTGG TCGTGGTGTC 
GGGTGTGTCG GGTGGGGTGG TGGGTTCGGG GGAGTTGGGG GATCCGGGGT ATTGGGTGCG 
TCATGCGCGG GAGGCGGTGC GTTTCGCGGA TGGGGTGGGG GTGGTGCGTG GTCTCGGTGT 
GGGGACGTTG GTGGAGGTGG GTCCGCATGG GGTGCTGACG GGGATGGCGG GTGAGTGCCT 
GGGGGCCGGT GATGATGTGG TGGTGGTGCC GGCGATGCGG CGGGGCCGTG CGGAGCGGGA 
GGTGTTCGAG GCGGCGCTGG CGACGGTGTT CACCCGGGAC GCCGGCCTGG ACGCCACGGC 
ACTCCACACC GGGAGCACCG GCCGGCGCAT CGACCTCCCC ACCTACCCCT TCCAACGCGA 
CCGCTACTGG CTGGACCCCG TTCGCACCGC CGTGACCGGC GTCGAGCCCG CCGGCTCGCC 
GGCGGACGCT CGGGCCACTG AGCGGGGACG GTCGACGACG GCCGGGATCC GCTACCGCGT 
CGCTTGGCAG CCGGCCGTCG TCGACCGCGG CAACCCCGGG CCTGCCGGTC ATGTCCTGCT 
TCTGGCCCCG GACGAGGACA CGGCCGACTC CGGACTCGCC CCCGCGATCG CACGTGAACT 
CGCCGTGCGC GGGGCCGAGG TCCACACCGT CGCCGTGCCG GTCGGTACAG GCCGGGAGGC 
AGCCGGGGAC CTCTTGCGGG CCGCCGGTGA CGGTGCCGCC CGCAGCACCC GAGTTCTCTG 
GCTCGCCCCG GCCGAGCCGG ACGCGGCCGA CGCCGTCGCC CTCGTCCAGG CGCTGGGCGA 
GGCGGTACCC GAAGCCCCGC TCTGGATCAC CACCCGTGAG GCGGCGGCCG TGCGGCCGGA 
CGAGACCCCT TCCGTCGGGG GCGCTCAGCT GTGGGGACTC GGACAGGTCG CCGCGCTCGA 
ACTGGGGCGG CGCTGGGGCG GCTTGGCGGA CCTGCCCGGG AGTGCGTCGC CCGCGGTCCT 
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CCGTACGTTC GTCGGGGCGC TGCTCGCCGG 
5 CGGCGTCCAT OTCCGCCGTG TGGTTCCCGC 

CGTCACCACG GCCCCCGCCA CCGCCGTCGG 
GGTCGTGCCC GACGACCGGT GGTCCTCCGG 
70 CCTCGGTOCG CAGGTCGCCC GCAGGCTCGC 

GGGCCGGCGC GGCGCGGCCG GCCCCGGAGT 
CGGTTCCGAA GTGGCCGTCG AGGCCTGCGA 
15 CCTCGCGGGC CTCCCCGAGG AGCGGCCCCT 

CGACGACGGT GTGCTCGACT CGCTCACCTC 
GGTCACCGCC GCCCGTCACC TGGACGAGCT 

20 

GCTCTTCTCC TCCATCGTCG GCGTGTGGGG 
CAACGCCGCG CTCGACGCCC TGGCGCAGCG 

25 GATCGCCTGG GGGCCG1GGG CCGGTGCCGG 

CGAACGGGAC GGCGTCACGG CCCTGGACCC 
GGTGGGCGCC GGCGGGACCT CTGCCGCAGG 

30 CGTCGCCGAC GTGGACTGGG AGACCTTCGT 

GCTCTTCGAC GGCGTCTCCG CCGCCCGTTC 
TGCCGCTCTC ACCCCAGGGA CGCGGCCGGG 

35 

CGGGGGCGAG OGCCGGCCGT GGCTCTCCGT 
TCTGCTCACG CTTGTCCGCT CGGAGGCCGC 
GGTCGACCCG GAGCTGGCCT TCCGGTCCGC 

40 

GCGTAACCGC CTGACCGCTG CCACCGGCCT 
CCCGACCCCC CTCTCGCTCG CCTCCCACCT 

45 GGCGGAGCCG GCAGCGGCCG CCCCCACGCC 

GATCGTGGGC ATGGCGTGCC GTTACCCGGG 
CCTGGTGGCC GGTGACGGGC ACACGCTCTC 

50 CGAGGGGCTG TACGACCCGG AGCCGGGGGT 

GTTCCTGCGT TCCGCGGCCG AGTTCGACGC 
CACGGCCATC GACCCGCAGC AGCGGTTGCT 

55 

GGCCGGCATC GTTCCGGACT CGCTGCGCGG 



GGGAGAGAAC CAGTTCGCGG TACGGCCCTC 6900 
GCCCGTCCCC GTCCCGGCCT CCGCTCGCAC 6960 
CGAGGACGCA CGGAACGACA CCTCGGACGT 7020 
CACCGTACTG ATCACCGGGG GCACCGGTGC 7080 
CCGGTCGGGC GCCGCGCGTC TGCTCCTGGT 7140 
GGGCGAACTC GTCGAGGAGC TGACGGCGCT 7200. 
CGTCGCCGAC CGGGACGCAC TGGCCGCGCT 7260 
CGTCGCCGTA CTGCACGCGG CAGGTGTGCT 7320 
CGACCGGGTG GACGCCGTAC TGCGGGACAA 7380 
GACCGCGGAC CTTCCGCTCG ACGCCTTCGT 7440 

CAACGGAGGG CAGGCCGTCT ACGCGGCCGC 7500 

GCGCCGGGCC AGGGGAGCCC GTGCCGCCTC 7560 

AATGGCCTCC GGAACGGCGG CGAAGTCCTT 7620 

CGAGCGCGCG CTCGACGTCC TCGACGACGT 7680 

GACGCACGCG GCCGGCGAGA GCTCCCTGCT 7740 

CGGGCGTTCG GTCACCCGCC GTACCTGGTC 7800 

GGCGCGTGCC GGCCATGCCG CGGACGACCG 7860 

CGACGGCGCA CCGGGCGGGA GCGGACAGGA 7920 

CGGCCCCTCG CCGGCGGAAC GCCGTCGTGC 7980 

CGGGATCCTG CGCCACGCCT CGGCCGACGC 8040 

CGGGTTCGAC TCCCTCACCG TTCTCGAACT 8100 

GAACCTGCCG AACACGCTGC TCTTCGACCA 8160 

GCACGACGAA CTCTTCGGTC CCGACAGCGA 8220 

GGTCATGGCC GACGAGCGTG AGCCGATCGC 8280 

CGGTGTGGCG TCGCCGGACG ACCTGTGGGA 8340 

CCCGTTCCCG GCCGACCGTG GCTGGGACGT 8400 

GCCGGGCAAG AGCTATGTAC GGGAAGGCGG 8460 

GGAGTTCTTC GGGATATCGC CGCGCGAGGC 8520 

GCTGGAGACG TCGTGGGAGG CGCTCGAGCG 8580 

CACCCGGACC GGTGTCTTCA GCGGCATCTC 8640 



114 



EP 0 791 655 A2 



CCAGCAGGAC TACGCGACCC AGCTGGGGGA 
CACGGGGACC CTCGGCAGTG TGATCTCCGC 

5 

GCCGGCGCTG ACGGTGGACA CGGCGTGTTC 
GCAGTCGTTG CGGCGGGGTG AGTGTGATCT 

10 GACGCCGACG GTGTTCGTGG AGTTCTCGCG 

CAAGGCGTTC GCGGACGCTG CGGACGGGAC 
GGTGGAGCGG CTTTCCGACG CGCGCCGCAA 

15 CAGTGCGGTC AATCAGGACG GTGCGAGCAA 

GCAGCGGGTG ATCCGTGAGG CGCTGGCTGA 
GGTGGAGGCG CACGGTACGG GGACGGCGCT 

20 

GGCCACGTAC GGGCGGGAGC GGGTCGGCGA 
CATCGGGCAT GCGCAGGCGG CTGCGGGTGT 
GCGGCATGGG TCGTTGCCGC GGACGCTGCA 

25 

GGCTTCGGGT GCGGTGGAGC TGCTGACCGA 
GGTGCGGCGG GCCGCGGTGT CGGCGTTCGG 

30 GGAGGAAGCG CCGGCGGAGG CCGGGAGCGA 

CGACGCGGTG ACGGGTCCGT TGTCGTGGGT 
GGCGCAGGCG GTGCGGTTGC GTGAGTGTGT 

35 GGCGGGGTCG TTCGTGGTGT CGCGTGCGTC 

GGGGCGTGAG GAGTTGCTGG CGGGTCTGGA 
GTCTTCGGGG GCCGGTGCTG TGGTGCGGGG 

40 

GTTGTTCACG GGTCAGGGTG CGCAGTGGGT 
TGGGGTGTTT GCGGAGCTGC TGGATGAGGT 
GTCGTTGCGG GATGTGATGT TCGCGGATGC 

45 

GACGGAGTTT GCTCAGCCTG CGTTCTTTGC 
GGCTCGGGGT GTGGAGGTGT CGGTGGTGTT 
50 GTATGTGGCG GGGGTGTTGT CGTTGGGTGA 

GTTGATGGGT GGGTTGCCGG TCGGTGGGGG 
GGTGCGGGGG GTTGTTGAGG GGTTGGGGGA 

55 

GCGGTCGGTG GTGTTGTCGG GTGATGTGGG 



CGCCGCCGAC ACCTACGGCG GGCATGTGCT 8700 

TCGGGTTGCC TATGCGTTCG CGTTCSCAGGG 8760 

GTCGTCGTTG GTGGCGTTGC ATCTOGCGGT 8820 

GGCGTTGGCC GGTGGGGTGA CGGTGATGGC 8880 

GCAGCGGGGG CTGGCGGCGG ACGGGCGGTG 8940 

GGCGTCGGCG GAGGGTGTGG GTGTGCTGCT 9000 

CGGTCATCGG GTGCTGGCGG TGGTGCGGGG 90 SO 

TGGGCTGACG GCGCCGAGTG GTCCGGCGCA 9120 

TGCGGGGCTG GTGCCCGCCG ACGTGGATGT 9180 

GGGTGATCCG ATCGAGGCGG GTGCGCTGCT 9240 

TCCGTTGTGG CTCGGGTCGT TGAAGTCGAA 9300 

GGGTGGTGTG ATCAAGGTCG TGCAGGGGAT 9360 

TGTGGATGCG CCGTCGTCGA AGGTGGAGTG 9420 

GACCCGGTCG TGGCCGCGGC GGGTGGAGCG 9480 

GGTGAGCGGG ACCAACGCCC ATGTCGTCCT 9540 

GCACGGGGAC GGCCCTGAAC CTGAGCGGCC 9600 

GCTTTCTGCG CGGTCGGAGG GGGCGTTGCG 9660 

GGAGCGGGTG GGTGCGGATC CGCGGGATCT 9720 

GTTCGGTGAG CGTGCGGTGG TGGTGGGCCG 9780 

TGTGGTGGCT GCCGGGGCTC CTGTGGGTGT 9840 

GAGTGCGGTG CGGGGTCGTG GGGTGGGGGT 9900 

TCGTATGGGG CGTGGGTTGT ATGCGGGGGG 9960* 

GTTGTCGGTG GTGGGGGAGG TGGATOGTCG 10020 

TGACTCGGTT TTGGGTGGGT TGTTGGGTCG 10080 

GTTGGAGGTG GCGTTGTTCC GGGCGTTOGA 10140 

GGGTCATTCG GTGGGGGAGG TGGCTGCTGC 10200 

TGCGGTGCGG TTGGTGGTGG CGCGGGGTGG 10260 

GATGTGGTCG GTCGGGGCGT CGCAGTCGGT 10320 

GTGGGTGTCG GTTGCGGCGG TGAATGGGCC 10380 

TGTGCTGGAG TCGGTGGTTG TCACGCTGAT 10440 
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GGGGGATGGG GTGGAGTGCC GGCGGTTGGA 
5 GGAGCCGGTG TTQGGGGAGT TCCGGGGGGT 

GCCGGGTGTG GTGGTGGTCT CGGGTGTGTC 
GGATCCGGGG TATTGGGTGC GTCATGCGCG 
70 GGTGGTGCGT GGTCTGGGTG TGGGGACGTT 

GGGGATGGCG GGTCAGTGCC TGGAGGCCGG 
GCGGGGCCGT CCGGAGCGGG AGGTGTTCGA 
15 CGCCGGCCTC GACGCCACGA CACTCCACAC 

CACCTACCCC TTCCAACACA ACCGCTACTG 
CACCTCGGCA GCCGCGCGCT TCGGCCTGGA 

20 

CACGCCGATA GCCGGCTCCG GCGCGCTGCT 
CCCGTGGCTG GCCGACCACG CCATCTCCGG 
CGACCTGCTG CTGCGGGCGG TCGAGGAGGT 

25 

CCATGAGCCC CTGCTCCTCC CCGAGCGAGG 
GGCCGACGAG CAGGGACGGC GTGCCGTGGC 

30 GGACGGTGAG GAACAGGAGT GGACCCGGCA 

GGCCGTTCCG GACATGGGCT GGGCCGCCGG 
CGACGTCGAG GAGCTGTACG ACGCGTTCGC 

35 CACCGCACTG TCCGGCGTGT GGCGTCTCGG 

CGCGGGGGGC GCGGGCACGA CCGGTGACGG 
GGCCCTCCAC CCGTGGCGCG CCGGCGGGCT 

40 

GCCGTTCTCC TGGCAGGGCA TCGCGCTCCA 
ACTGGCCCCT GCGGCCGGCG GCACCGAGTC 

45 GGGCACCCCG GTCCTCACCC TCGACGCACT 

CGACGCGCCG CAACCGC1CT ACCGCGTCGA 
CTCCGGCGCC CAGGGCTGGA CGGTGCTCGG 

50 CGCCGCCCAT GCGGACCTCA CCGCCCTGCG 

GCCCCGGCTG GTGGTCGTGT CGCCGGTGGA 
GGACGCCGAG GCTCGGGCCC GTGCGGGTCA 

55 

CCTCGGGCGC GGCCTCACCC TGGTCCGGGA 



TGTGTCGCAT GGGTTTCATT CGGTGTTGAT 10500 

TGTGGAGTCG TTGGAGTTCG GTCGGGTGCG 10560 

GGGTGGGGTG GTGGGTTCGG GGGAGTTGGG 10620 

GGAGGCGGTG CGTTTCGCGG ATGGGGTGGG 10680 

GGTGGAGGTG GGTCCGCATG GGGTGCTGAC 10740 

TGATGATGTG GTGGTGGTGC CGGCGATGCG 10800 

GGCGGCGCTG GCGACGGTGT TCACCCGGGA 10860 

CGGGAGCACC GGCCGACGCA TCGACCTCCC 10920 

GGCAACCGGC TCAGTGACCG GTGCGACCGG 10980 

GTGGAAGGAC CACCCCTTCC TCAGCGGCGC 11040 

CCTCACCGGC AGGGTGGGGC TCGCTGCCCA 11100 

CACGGTGCTG CTCCCCGGAA CGGCGATCGC 11160 

CGGCGCCGGA GGGGTCGAGG AACTGACGCT 11220 

CGGCCTGCAC GTCCAGGTGC TGGTCGAGGC 11280 

AGTCGCCGCA CGCCCGGAGG GCCCTGGGCG 11340 

CGCGGAAGGC GTGCTCACCT CCACCGAGAC 11400 

GGCCTGGCCG CCXXTCCGGTG CCGAGCCGAT 11460 

CGCGGACGGC TACGGCTACG GCCCGGCCTT 11520 

CGACGAACTC TTCGCCGAGG TCCGGCGGCC 11580 

TTTCGGCGTC CACCCCGCAC TCTTCGATGC 11640 

GCTGCCCGAC ACGGGCGGCA CCACCTGGGC 11700 

CACCACCGGA GCCGAGACGC TCCGCGTCAG 11760 

GGCCTTCTCC GTACAGGCCG CCGACCCGGC 11820 

GCTGCTCCGC CCGGTGACCC TGGGGAGGGC 11880 

CTGGCAGCCG GTCGGCCAGG GGACCGAGGC 11940 

GCAGGCCGCG GCCGAGACGG TCGCGCAGCC 12000 

TACGGCTGTG GCCGCGGCGG GAACACCCGT 12060 

CACCCGGCTG GACGAGGGGC CGGTGCTGGC 12120 

CGGCTGGGAC GACGATCCCC TACGTGTCGC 12180 

GTGGGTCGAG GACGAACGGT TGGCGGACTC 12240 
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CCGGCTCGTC GTCCTCACCC GTGGCGCGGT 
GACAGGTGCG GCCCTGTGGG GGCTGCTCCG 

5 

CACCCTCATC GACGTGGACG ATTCCCCCGA 
ATCGGCCGAG CGACAACTCG CCCTGCGGAC 

10 GATGGCCACC CGGCCGGCGG AGACCACTCC 

GACACAGGTC ACCGCGCCCG CTCCCGACGA 
GGGCACCGTA CTGATCACCG GCGGCACCGG 

75 CGCGCGCCGG TACGGCGTAC GCCACATGCT 

CGAGGCCGGT CCCCTGGAAC GGGAACTCGC 
ATGCGACCTC ACCGACATCG AGGCCGTACG 

20 

CCCGCTGACC GGTGTGGTGC ACACCGCCGG 
GACCCGGCAA CGCCTCGACA CCGTGCTGCG 
CGAGGCGACC CTCGACCGGC CGCTGCGCGC 

25 

CCTGGGCCGC CCCGGGCAGG CCTCCTACGC 
GGGAGCCCGC CGCGCGGCCG GACTGCCCGC 

30 GCAGACGGGC ATGGCAGGAG GCCTCGACGA 

CATCGCCGCG ATGCCTCCGG AGCAGGGGCT 
CCGGGACGGA CCCGCCGTCC TCGTCCCCCT 

35 GGCGAAGGAG CGCGGCGCGG CCACGATGTC 

CCTGCGCCGC AGCGGTGGAG CCGGCGCCCC 
GGACCCCGGT GCGGGACGCC TCGCAGGGAT 

40 

GGCCGTCCTT GAGCTGGTCA CCGAACAGGT 
GGAGATCGAG CCCGAACGAC CCTTCCGGGA 
GCTCCGCAAC CGGCTCAGCC GTCTGGTCGG 

45 

CCACCCCACG CCGAAGGACA TGGCGCAGCA 
AGCCTCGCCC GCGGACGCAG CGCTGGAAGG 
50 GCTGGGCACG GGCGACGCCC GCCGGGCCGA 

CGCGCTCGAC CCACCTOGGC GGACGGGCAC 
TGGCGCGGAA CCGACCGTGA CGGACCGGCT 

55 

CTTCCTGGAC GAGCAGCTGT GACCACACCG 



GGCGGCCGGT CCCGGCGATG TGCCGGACCT 12300 

CTCCGCGCAG TCGCACTATC CGGACCGCTT 12360 

GTCCCGTGCG GCTCTGCCCC GGGCTCTGGG 12420 

GGGCGACGTG CTGGCGCCGG CCCTGGTCCC 12480 

AGCGACGGCG GTCGCCTCGG CGACAACACA 12540 

CCCGGCTGCG GATCCCGTCT TCGACCCGGC 12600 

CGCCCTGGGA CGGCGTGTCG CCTCGCACCT 12660 

TCTGGTCAGC AGGCGTGGAC CGGACGCCCC 12720 

CGGTCTCGGA GTCACCGCCA CCTTCCTGGC 12780 

GAAGGCCGTC GCCGCGGTGC CGTCGGACCA 12840 

CGTGCTGGAC GACGGCGCCC TGACCGGCCT 12900 

GCCCAAGGCC GACGCCGTGC GGAACCTCCA 12960 

GTTCGTCCTG TTCTCCGCCG CCGCCGGACT 13020 

CGCCGCCAAC GCGGTCCTCG ACGCGCTCGC 13080 

AGTGTCCCTG GCGTGGGGCC TGTGGGACGA 13140 

GATGGCCCTC CGCGTGCTGC GCCGGGACGG 13200 

CGAACTGCTC GACCTGGCCC TGACCGGACA 13260 

CCTCCTCGAC GGCGCGGCCC TGCGCCGCAC 13320 

CCCCTTGCTG CGCGCCCTGC TGCCCGCCGC 13380 

CGCGGCGGCC GACCGGCACG GCAAGGAGGC 13440 

GGTGGCACTC GAAGCGGCGG AGCGTTCCGC 13500 

CGCCGAGGTC CTCGGCTACG CGTCGGCCGC 13560 

GATCGGCGTC GACTCCCTGG CGGCGGTGGA 13620 

CCTCCGGTTG CCGACCACGC TGTCCTTCGA 13680 

CATCGACGGG CAGCTCCCCC GCCCGGCCGG 13740 

GATCGGCGAC CTCGC GCGGG CGGTCGCCCT 13800 

GGTACGAGAG CAGCTCGTCG GACTGCTGGC 13860 

CGCCGCACCC GGCGTCCCCT CCGGTGCCGA 13920 

CGACGAGGCG ACCGACGACG AGATCTTCGC 13980 

TGGACCGACC GCATGCCGAG GAGTTGGTGG 14040 
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CAGCAATGAC CGCCGAGAAC GACAAGATCC 
TGCACCGGAC CAAGTCCCGC CTGGCCGAGG 

5 

TCGTGGGCAT GGCGTGCCGT TACCCGGGCG 
TGGTGGCAGC CGGTACGGAC GCGGTCTCCG 

w AGGGGCTGTA CGACCCCGAT CCGGAGGCGG 

TCCTGCACTC GGCGGCCGAG TTCGACGCGG 
CGGCGATGGA TCCGCAGCAG CGGTTGCTGC 

15 CGGGGATCGT CCCCGCGTCG CTGCGCGGCA 

ACGACGACTA CGGGTCGCGG TTCGACTCGG 
ACGGCAGCGC CGGCAGCATC GCGTCCGGTC 

20 CGGCGCTGAC GGTGGACACG GCGTGTTCGT 

AGTCGTTGCG GCGGGGTGAG TGTGATCTGG 
CGCCGACGGT GCTCGTGGAG TTCTCGCGGC 

25 

AGGCGTTCGC GGAGGGTGCG GACGGGACGG 
TGGAGCGGCT CTCCGACGCC CGCCGCAATG 

30 GTGCGGTCAA TCAGGACGGT GCGAGCAACG 

AGCGGGTGAT CCGTGAGGCG CTCGCCGACG 
TCGAGGCGCA CGGCACCGGC ACACCCCTGG 

35 CCACCTATGG CAGTGAGCGC CAGGGCCAAG 

ACATCGGGCA TGCGCAGGCG GCTGCGGGTG 
TGCGGCATGG GTCGTTGCCG CGGACGCTGC 

40 

GGGCTTCGGG TGCGGTGGAG CTGCTGACCG 
GGGTGCGGCG GGCCGCGGTG TCGGCGTTCG 
TGGAGGAAGC GCCGGCGGAG GCCGGGAGCG 

45 

CCGACGCGGT GACGGGTCCG TTGTCGTGGG 
GGGCGCAGGC GGTGCGGTTG CGTGAGTGTG 

50 TGGCGGGGTC GTTGGTGGTG TCGCGTGCGT 

GGGGGOGTGA GGAGTTGCTG GCGGGTCTGG 
TGTCCGGGGG CGTGTCTTCG GGGGCCGGTG 

55 GTGGGGTGGG GGTGTTGTTC ACGGGTCAGG 



GCAGCTACCT GAAGCGTGCC ACCGCCGAAC 14100 

TCGAGTCGGC GAGCCGCGAG CCGATCGCGA 14160 

GTGTGGCGTC GCCGGACGAC CTGTGGGACC 14220 

CGTTCCCCGT CGACCGTGGC TGGGACGTCG 14280 

TGGGGCGTAG TTACGTGCGC GACGGCGGGT 14340 

AGTTCTTCGG GATCTCGCCC CGTCAGGCGG 14400 

TGGAGACGTC GTGGGAGGCG CTGGAGCGGG 14460 

CCCGTACCGG CGTCTTCACC GGCGTCATGT 14520 

CTCCGCCGGA GTACGAGGGC TACCTCGTGA 14580 

GGGTTGCCTA TGCGTTGGGG TTGGAGGGGC 14640 

CGTCGTTGGT GGCGTTGCAT CTGGCGGTGC 14700 

CGTTGGCCGG TGGGGTGACG GTGATGGCGA 14760 

AGCGGGGGCT GGCGGCGGAC GGGCGGTGCA 14820 

CGTGGGCCGA GGGTGTGGGC GTGCTGCTGG 14880 

GCCATCGGGT GCTGGCGGTG GTGCGGGGCA 14940 

GGCTGACGGC GCCGAG1GGT CCTGCGCAGC 15000 

CGGGGCTGAC GCCCGCCGAC GTCGACGCGG 15060 

GCGACCCCAT CGAGGCGGGT GCGTTGCTGG 15120 

GTCCGTTGTC GTTGGGGTCG TTGAAGTCGA 15180 

TGGGTGGCGT GATCAAGGTG GTCCAGGCGA 15240" 

ATGTGGATGC GCCGTCGTCG AAGGTGGAGT 15300 

AGACCCGGTC GTGGCCGCGG CGGGTGGAGC 15360 

GGGTGAGCGG GACCAACGCC CATGTGGTCC 15420 

AGCACGGGGA CGGCCCTGAA CCCGAGCGGC 15480 

TGCTTTCTGC GCGGTCGGAG GGGGCGTTGC 15540 

TGGAGCGGGT GGGTGCGGAT CCGCGGGATG 15600 

CGTTCGGTGA GCGTGCGGTC GTGGTGGGCC 15660 

ATGTGGTGGC TGCCGGGGCT CCTGTCGGTG 15720 

CTGTGGTGCG GGGGAGTGCG GTGCGGGCTC 15780 

GTGCGCAGTG GGTTGGTATG GGGCGTGGGT 15840 
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TGTATGCGGG GGGTGGGGTG TTTCCGGAGG 
AGGTGGGGCC TTCGTCGTTG CGGGATGTGA 

5 

CCGGGGCTGA TGCGGGTGTC GGTTCGGGTG 
AGTTTGCTCA GCCTGCGTTG TTTGCGTTGG 

10 GGGGTGTGGA GGTGTCGGTG GTGTTCGGTC 

TGGCCCGGCT GTTGTCGTTG GGTGATCCGG 
TGGGTGGGTT GCCGGTCGGT GGGGGGATGT 

15 GGGGGGTTGT TGAGGGGTTG GGGGAGTGGG 

CGGTCGTGTT GTCGGGTGAT GTGGGTGTCC 
ATCGGGTGGA GTGCCGGCGG TTGGATGTCT 

20 

CGGTGTTGGG GGAGTTCCGG GGGGTTGTGG 
GTGTGGTGGT GGTGTCGAGT GTGTCGGGTG 
CGGGGTATTG GGTGCGTCAT GCGCGGGAGG 

25 

TGCGTGGTCT CGGTGTCGGC ACGTTGGTGG 
TGGCGGGTGA GTGCCTGGGG GCCGGTGATG 

30 GCCGTGCGGA GCGGGAGGTG TTCGAGGCGG 

GCCTGGACGC CACGACACTC CACACCGGGA 
ACCCCTTCCA ACACGACCGC TACTGGCTGG 

35 TGTCGGCGGC GGGTCTGCGC GAGGTCGAGC 

CCGGCACCGA CACCGAGGTG TGGACCGGCC 
CCGACCACCT GGTGTGGGAC CGAGGCGTGG 

40 

TCCAGGTGGG AAGCCGGATC GGTCTGCCGC 
TGACCTGGAC GTCGGACCGC CCGCTCCAGG 
CCCCCGGGGG CGCGCGTCAG CTGACCCTCC 

45 

CGTCCTCCCC GAGTCCCGCC TCTCCCCGGC 
GGACCCGGCA TGCCTCAGGG CTGCTCGCCC 
50 AGCTCACCGG CGCCTCGCCC CCCGTCGGCG 

CGCTCVTCGC kGCCGCCGGA GTGCGCTACG 
GGCGTCGAGG CGACGAGGTC TTCGCCGACG 

55 

CTGATCGTTA CGGGGTX3CAC CCCGCCCTGC 



TGCTGGATGA GGTGTTGTCG GTG G TGGGGG 15900 

TGTTCGGCGA CGTCGACCTG GACGCGGGTC 15960 

TTGGTCTGGG TGGGTTGTTG GGTCGGACGG 16020 

AGGTGGCGTT GTTCCGGGCG TTGGAGGCTC 16080 

ATTCGGTGGG GGAGGTGGCT GCTGCGTATG 16140 

TGCGGTTGGT GGTGGCGCGG GGTGGGTTGA 16200 

GGTCGGTGGG GGCGTCGGAG TCGGTGGTGC 16260 

TGTCGGTTGC GGCGGTGAAT GGGCCGCGGT 16320 

TGGAGTCGGT GGTTGCCTCG CTCATGGGGG 16380 

CGCATGGGTT TCATTCGGTG TTGATGGAGC 16440 

AGTCGTTGGA GTTCGGTCGG GTGCGGCCGG 16500 

GGGTGGTGGG TTCGGGGGAG TTGGGGGATC 16560 

CGGTGCGTTT CGCGGATGGG GTGGGGGTGG 16620 

AGGTGGGTCC GCATGGGGTG CTGACGGGGA 16680 

ATGTGGTGGT GGTGCCGGCG ATCCGGCGGG 16740 

CGCTGGCGAC GGTGTTCACC CGGGACGCCG 16800 

GCACCGGCCG ACGCATCGAC CTCCCCACCT 16860 

CCGCCCCGTC CCGGCCCAGG ACGGACGGGC 16920 

ACCCCCTGCT CACCGCCGCC GTGGAACTGC 16980 

GCATATCCGC TGCCGACCTG CCCTGGCTCG 17040 

TGCCGGGGAC CGCGCTGCTG GAGACGGTGC 17100 

GCGTCGCCGA ACTGGTCCTG GAGACGCCGC 17160 

TCCGGATCGT CGTGACCGCT GCCGCCACCG 17220 

ACTCGCGGCC CGAGCCCGTG GCCGCCTCCT 17280 

ACCTCACGGC GCAGGAGAGC GACGACGACT 17340 

CGGCTGCCGG CCTCGCCGAC GACTTCGCCG 17400 

CCGAGCCCCT CGACCTCGCC GGTCAGTACC 17460 

AAGGCGCCTT CCGAGGGCTG CGCGCGGCAT 17520 

TACGGCTGCC CGACGCGCAC GCGGTCGACG 17580 

TCGACGCGGT GCTCCACCCG ATCGCGTCGC 17640 



119 



EP 0 791 655 A2 



10 



15 



20 



25 



TCGACCCGCT GGGCGACGGC GGGCACGGTC TGCTGCCGTT CTCCTGGACC GACGTACAGG 17700 

GACACGGCGC CGGCGGACAC GCCCTCCGGG TACGGGTGGC GGCCGTCGAC GGCCGCGCGG 17760 

TGTCGGTCAC CGCGGCCGAC CACGCGGGCA ACCCGGTGTT ATCCGCCCGG TCCCTGGCAC 17820 

TGCGTCGTAT CACCGCGGAC CGGCTTCCCG CCGCGCCCGT CGCCCCTCTC TACCGCGTGG 17880 

ACTGGCTGCC GTTCCCGGGT CCGGTGCCCG TATCCGCGGG CGGCCGCTGG GCGGTCGTCG 17940 

GACCCGAGGC CGAAGCCACG GCTGCCGGAC TGCGTGCGGT GGGCCTCGAC GTGCGTACCC 18000 

ATGCGCTCCC CCTCGGAGAG CCCCTGCCTC CGCAGGCCGG TACCGACGCG GAGGTGATCA 18060 

TCCTCGACCT GACCACCACC GCAGCCGGCC GTACGGCGTC GGACGGGGGG CGGCTCAGTC 18120 

TCCTCGACGA GGTGCGTGCG ACGGTGCGCC GGACCCTCGA AGCCGTACAG GCCCGCCTCG 18180 

CCGACACCGA AACGGCCCCC GACGTCGACG TCCGTACGGC CGCGCGCCCC CGCACAGCCG 18240 

CCCGTACAAG CCCCCGCGTG GACACCCGCA CGGGAGCCCG CACCGCTGAC GGCCCCCGGC 18300 

TCGTCGTCCT GACCCGGGGC GCGGCCGGAC CCGAGGGAGG CGCGGCCGAT CCCGCGGGTG 18360 

CCGCTGTCTG GGGGCTCGTC CGGGTCGCCC AGGCCGAACA GCCCGGCCGC TTCACCCTGG 18420 

TGGACGTCGA CGGCACCCAG GCGTCGCTGC GGGCCCTGCC CGGTCTGCTG GCCACGGATG 18480 

CCGGCCAGTC GGCCGTGCGC GACGGACGTC TCACCGTCCC GCGCCTCGTC CCGGTGGCCG 18540 

30 ACCCCGTCCC CCACGGCGGC GGCACGGCGG CCGACGGGAC GGGTGCCGGC GAGCCGTCCG 18600 

CGACCCTGGA CCCCGAAGGC ACCGTGCTGA TCACCGGCGG CACCGGAGCA CTGGCCGCGG 18660 
3 AAACCGCCCG GCACCTGGTC GACCGGCACA AGGTGCGCCA TCTCCTGCTG GTGGGCAGGC 18720 

GCGGTCCCGA CGCACCCGGC GTCGATCGAC TCGTCGCCGA GTTGACCGAG TCGGGTGCCG 18780 

AGGTCGCCGT ACGGGCCTGT GACGTCACGG ACCGCGACGC CCTGCGCCGC CTGCTCGACG 18840 

CACTCCCCGA CGAACACCCG CTGACCTGCG TGGTGCACAC CGCCGGGGTG CTCGACGACG 18900 

GCGTGCTCTC CGCCCAGACG GCCGAGCGGA TCGACACGGT GCTCCGGCCC AAGGCCGACG 18960 

CCGCCGTCCA CCTGGACGAG CTGACCCGGG AGATCGGACG GGTGCCCCTG GTGCTGTACT 19020 

^ CCTCGGTCTC GGCCACCCTG GGCAGCGCGG GGCAGGCCGG GTACGCGGCG GCCAACGCCT 19080 

TCATGGACGC GCTGGCCGCC CGGCGGTGCG CCGCCGGGCA CCCCGCGCTG TCGCTCGGCT 19140 

GGGGCTGGTG GTCCGGGGTG GGTCTCGCCA CCGGACTGGA CGGAGCGGAC GCGGCGCGGG 19200 

SO TCAGGCGCTC GGGTCTCGCC CCGCTCGACG CCGGCGCCGC ACTGGACCTG CTCGACCGGG 19260 

CGCTGACCCG GCCCGAGCCG GCCCTGCTGC CCGTGCGGCT CGACCTGCGC GCCGCGGCCG 19320 

GTGCCACCGC TCTCCCGGAG GTCCTGCGTG ACCTGGCCGG CGTACCGGCG GACGCCCGCA 19380 

GCACGCCCGG GGCCGCGGCG GGCACCGGGG ACGAGGACGG TGCCGTGCGC CCTGCCCCCG 19440 



35 



40 



55 
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CCCCGGCCGA CGCCGCCGGG ACGCTGGCCG CGCGGCTCGC GGGACGTTCC GCACCCGAGC 19500 
GTACGGCTCT CCTGCTCGAC CTGGTGCGGA CCGAGGTCGC GGCGGTGCTC GGACACGGCG 19560 

5 

ACCCCGCCGC GATCGGCGCC GCCCGCACCT TCAAGGACGC CGGATTCGAC TCCCTCACCG 19620 
CTGTCGACCT CCGCAACCGG CTGAACACAC GCACCGGACT GCGGCTGCCC GCGACCCTCG 19680 

10 TCTTCGACCA CCCCACACCG CTCGCCCTCG CCGAACTCCT GCTCGACGGG CTGGAGGCGG 19740 

CCGGTCCAGC GGAACCGGCC GCTGAGGTCC CGGACGAAGC GGCCGGTGCC GAGACCCTGT 19800 
CCGGCGTGAT CGACCGGCTG GAACGCAGCC TCGCCGCGAC CGACGACGGC GACGCCCGGG 19860 

15 TCCGCGCGGC ACGGCGGCTG CGCGGCCTCC TGGACGCGCT CCCCGCCGGT CCCGGTGCCG 19920 

CGTCCGGTCC GGATGCCGGA GAGCACGCCC CCGGTCGCGG CGACGTGGTG ATCGACCGGC 19980 
TCAGGTCGGC CTCCGACGAC GACTTGTTCG ACCTGCTCGA CAGCGACTTC CAGTGAGCCG 20040 

20 

GACCGCGCCG CGCGCCGACC GCTGAACCGC TCTTCACCCA GACCCACGAG ACCACGCCTG 20100 

AGGAGAACCG TGTCTGCGAC CAACGAGGAG AAGTTGCGGG AGTACCTGCG GCGCGCGATG 20160 

GCCGACCTGC ACAGCGCACG AGAGCGGTTG CGCGAGGTCG AGTCGGCGAG CCGTGAGCCG 20220 

25 

ATCGCGATCG TGGGCATGGC GTCCCGTTAC CCGGGCGGTG TOGCGTCGCC GGAGGAGCTG 20280 

TGGGACCTGG TGGCCGCCGG TACGGACGCG ATCTCCCCGT TCCCCGTCGA CCGCGGCTGG 20340 

30 GACGCCGAGG GTCTGTACGA CCCGGAGCCG GGGGTGCCGG GCAAGAGCTA CGTGCGCGAG 20400 

GGCGGGTTCC TGCACTCGGC GGCCGAGTTC GACGCGGAGT TCTTCGGGAT CTCGCCGCGT 20460 

GAGGCGGCGG CGATGGATCC GCAGCAGCGG TTGCTGCTGG ACACGTCGTG GGAGGCGCTG 20520 

35 GAGCGGGCCG GGATCGTCCC CGCGTCGCTG CGCGGCACCC GTACCGGCGT CTTCACCGGC 20580 

GTCATGTACC ACGACTACGG CAGCCACCAG GTCGGCACCG CCGCCGATCC CAGTGGACAG 20640 

CTCGGCCTCG GCACCGCGGG GAGCGTCGCC TCGGGCCGGG TGGCGTACAC CCTCGGTCTA 20700 

40 

CAGGGGCCGG CCGTGACCAT GGACACGGCA TGCTCGTCCT CGCTGGTGGC GTTGCACCTG 20760 

GCGGTGCAGT CGTTGCGGCG GGGCGAGTGC GATCTCGCGT TGGCCGGCGG GGCGACGGTC 20820 

45 TTGGCGACGC CCACGGTGTT CGTGGAGTTC TCGCGGCAAC GGGGGCTGGC GGCGGACGGA 20880 

CGGTGCAAGG CGTTCGCGGA GGGCGCCGAC GGCACGGCGT GGGCCGAGGG CGCCGGTGTG 20940 

CTGCTGGTGG AGCGGCTCTC CGACGCCCGC CGCAACGGCC ATCGGGTCCT CGCGGTGGTC 21000 

50 CGGGGCAGCG CGGTCAACCA GGACGGTGCC AGCAACGGCC TCACCGCACC CAGCGGGCCC 21060 

GCCCAGCAGC GGGTGATCCG TGACGCGCTG GCCGACGCGG GGCTGACGCC CGCCGACGTC 21120 

GACGCGGTCG AGGCGCACGG CACCGGCACA CCGCTCGGCG ACCCGATCGA GGCCGGCGCG 21180 

55 CTGATGGCCA CCTACGGCAG TGAACGGGTG GGCGACCCGC TGTGGCTGGG TTCGCTGAAG 21240 
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TCGAACATCG GACACACCCA GGCCGCCGCC 
GCCTTACGGC AGTCCGAGCT GCCGCGCACC 

5 

GAATGGGACG CGGGCGCCGT GC AACTGCTC 
CACAGGCCCC GGCGGGCCGC GGTCTCCGCC 

10 ATCATCGAGG AACCGCCCGC GGCCGGTGAC 

GGCGAGGCGA CCGCGTCCCC CTCCACCGCG 
GGGCCGCTGT CCCCCTCCTC CCCGGCCGTG 

15 GCCCTGCGCG CCCAGGCCGC CCGCCTGCGG 

CCGACCGACA TCGGCCACGC CCTGGCCGCC 
CTGCTCGGCG ACGACGGAGC CCCGGTCGAC 

20 

ACCCCCGACG CCGTCCACGG CACCGCGGCG 
GGCCAGGGTT CCCAGTGGGC CGGGATGGGC 
GCCGCCGAAC TGGACCGCTG CCAGGGCGCG 

25 

GACGTGCTGC GCGGCGCGCC CGCGGCGCCC 
GCCACCTTCG CCGTCATGGT GGGACTCGCC 

30 GCCGCCGTCA TCGGCCACTC CCAGGGCGAG 

TCCCTGGAGG ACGCCGCCCG GATCGTGGCC 
GCCGGGCGGG GCGGCATGGC CTCGGTGGCC 

35 GCCGGCGGCG TCGAGATCGC CGCCGTCAAC 

CCCGGCGCCC TGGAGGCGTT GCTCGTCACG 
ATCGACGTCG ACTACGCGTC CCACTCCCAC 

40 

ACCGTCCTCG GCCCCGTCCG GCCGCGGAGG 
GCGGCGCTCC TCGACACCGC CACCCTGGAC 
CCGGTGCGCT TCGAGCCGAC CGTACGCGCC 

45 

GAGTGCTCCG CGCATCCCGT CCTGACCGTC 
GGCGCGGTCC CGGCCCTCGC TTCGCTGCGC 

50 ACCTCCGCCG CCGAGGCCCA GGTCGTCGGC 

GGCGCCGGCC GGGTGGACCT GCCGACCTAC 
CCCGCCCGGC CCGACTCCGC GGCGACGGCC 

55 GGAGACCGGC TCGGCTACCA CGTCGCGTGG 



GGAGCCGCCG GCGTCATCAA GATGGTGCAG 21300 

CTGCACGTCG ACGCGCCCTC GGCCAAGGTC 21360 

ACCGGCGTCC GGCCATGGCC CCGGCGCGAG 21420 

TTCGGCGTCA GCGGCACCAA CGCCCACGTC 21480 

ACCTCGCCCG CCGGCGACAC CCCTGAGCCG 21540 

GCCGGGCCGT CGTCCCCCTC CGCGGTGGCC 21600 

GTCTGGCCCC TGTCCGCCGA GACCGCCCCC 21660 

GCGCACCTCG AACGCCTCCC CGGCACCTCG 21720 

GAACGCGCCG CCCTCACCCG ACGCGTCGTG 21780 

GCACTCGCCG CCCTCGCCGC CGGCGAGACC 21840 

GACATCCGCC GGGTCGCCTT CGTGTTCCCC 21900 

GCCGAACTGC TGGACACGGC CCCGGCCTTC 21960 

CTCTCCCCGT ACGTGGACTG GAACCTCGCG 22020 

GGCCTCGACC GGGTCGACGT CGTCCAGCCG 22080 

GCGCTGTGGC GCTCCCTCGG GGTCGAACCC 22140 

ATCGCCGCGG CCTGCGTGGC GGGCGCGCTC 22200 

CTGCGCTCCC AGGTCATCGC CCGCGAACTG 22260 

CTGCCCGCGG CGGAGGTCGA GGCCCGCCTG 22320 

GGCCCCGGCT CGACCGTCGT CTGCGGAGAG 22380 

CTGGAGAGCG AAGGCACCCG GGTCCGCCGC 22440 

TACGTCGAGA GCATCCGGGC GGAACTCGCC 22500 

GGCGACGTGC CCTTCTACTC CACCGTCGAG 22560 

GCCGACTACT GGTACCGCAA CCTGCGCCTC 22620 

ATGCTCGACG ACGGCGTCGA CGCGTTCGTG 22680 

GGCGTGCGCC AGACCGTGGA GAGCGCCGGC 22740 

CGCGACGAGG GCGGGCTGCG GCGCTTCCTC 22800 

GTCCCCGTGG ACTGGGCGAC GCTCCGCCCA 22860 

GCCTTCCAGC GCGAACGCCA CTGGGTCGGC 22920 

GCCACGACCG GTGACGACGC CCCGGAGCCC 22980 

AAGGGACTGC GCTCCACCAC CGGCGGCTGG 23040 
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CGCCCCGGCC TGCGCCTGCT GATCGTGCCC 
ACCCTGGAAC AGGCGGTCGC CTCCTTCGGC 

5 

GCACGCACCG GACGCGCCGA GCTGTTCGGC 
GCCGTCACCG GCGTCGTCTC GCTGCTCGGA 

10 GCCGTGCCCG TCGCCGTCAC CGCCACCCTC 

AGCACCGCAC CGCTGTCGAC CGTCACCTGC 
CCGTGCACCG CCGGTGCCCA GCTGTGGGGC 

75 GAGGTGTGGG GCGGCCTCAT CGACCTTCCC 

CTCGCCGGCG TCCTCGCCGA ACCCGGCGGC 
GTCTTCGGCC GCCGGGTCCT GCGGAACCCG 

20 

CGGGGCACCG TCCTCATCGC CGGCGACCTC 
CTCCTCGAGG ACGGCGCGGA CCGCGTGGTG 
GCCGCCGCCG GACTGACCGG CGTCTCCCTC 

25 

GCCGCACTGG CCGCGCTGCT CGACGAGCAC 
CTGGTGCCCC TGGCGCCGCT GCGGGAGACG 

30 GCCAAGACCA CGGCCGCCGG CCACCTGGTC 

CTGGTGCTGT TCTCCTCGGT CTCCGGAGTG 
GCCGCCAGCG CGCACCTCGA CGCGCTGGCC 

35 TTCTCCGTGG CCTGGAGCCC CTGGGCCGGA 

TTCCTCAGCC GGCGCGGGCT GGCTCCCCTC 
CGCATGCTGG AGCGCGGCAG CGCCTGCGGT 

40 

GCCGCCTCCT ACACCTGGGT GCGTCCCGCC 
CGGCTGCGCG CGGCCGAACT CGCCCCGAGC 
CGCGAGCTGA CCGCGCAGTC CGGCCACAAG 

45 

GCACACGCCG CCGCCGTCCT CGGACAGTCC 
TTCCGCGACC TCGGCTTCGA CTCGCTGACC 
50 AGCACCGGGC TCAAACTGCC CACCTCCCTG 

GCCCGGCACC TCGGTGAGGA ACTCCTCGGC 
GACACCCCGG TACGGACGGA CGAGCCCATC 

55 

GGCGGGGTGC AGTCCCCCGA GGACCTGTGG 



ACCGGGGACC AGTACACCGC CCTCGCCGAC 23100 

GGAACGGTCC GCCGCGTCGC CTTCGACCCG 23160 

CTGCTCGAGA CGGAGATCAA CGGCGACACC 23220 

CTGTGCACCG ACGGCAGGCC GGACCACCCC 23280 

GCCCTCGTCC AGGCCCTGGC CGACCTCGGC 23340 

GGCGCGGTCG CCACCGCCCC CGACGAACTG 23400 

CTGGGCCGGG TGGCCGCGCT GGAGCTGCCC 23460 

GCGCGGCCCG ACGCCCGGGT CCTGGACCGT 23520 

GAGGACCAGA TCGCCGTACG GATGGCGGGC 23580 

GCCGACTCCC GGCCCCCGGC CTGGCGCGCC 23640 

ACGACGGTGC CCGGCCGACT GGTCCGGTCC 23700 

CTGGCCGGAC CCGACGCCCC CGCACAGGCC 23760 

GTCCCCGTGC GCTGCGACGT CACCGACCGC 23820 

GCGCCCACCG TCGCCGTGCA CCCCCCGCCC 23880 

GCACCCGGCG ACATCGCCGC CGCCCTCGCC 23940 

GACCTGGCGC CGGCCGCGGG CCTCGACGCG 24000 

TGGGGCGGCG CGGCCCAGGG CGGCTACGCG 24060 

GAACGCGCCC GCGCCGCGGG GGTGCCCGCG 24120 

GGCACGCCCG CCGACGGTGC CGAGGCGGAG 24180 

GACCCCGACC AGGCGGTGCG GACCCTGCGC 24240 

GCGGTCGCCG ACGTCGAGTG GAGCCGGTTC 24300 

GTACTCTTCG ACGACATCCC GGACGTGCAG 24360 

ACCGGAGACT CGACCACCTC CGAACTCGTC 24420 

CGGCACGCCA CCCTGCTGCG GCTGGTGCGC 24480 

TCCGGCGACG CGGTGAGCAG CGCCCGCGCC 24540 

GCCCTCGAAC TGCGCGACCG GCTCAGCACC 24600 

GTCTTCGACC ACTCCAGCCC GGCCGCGCTC 24660 

CGG AACGACA CCGCCGACCG GGCCGGCCCC 24720 

GCCATCATCG GCATGGCCTG CCGGCTGCCC 24780 

GACCTGCTGA CCGGTGGGAC CGACGCCATC 24840 
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ACCCCCTTCC CGACCAACCG GGGATGGGAC 
TCGCCCGGGC ACCACACCTA CGTGCCCGAG 

5 

GACCCCGGCT TCTTCGGCAT CAGCCCCCGC 
CTGATCCTGG AGACGTCCTG GGAGTCCTTC 

10 CGCGGCAGCC GCACCGGGGT CTTCGTCGGC 

CAGGACGGCG ACGAGAACTT CGACGGCTAC 
TCCGGCCGGC TCTCCTACGT CTTCGGACTG 

15 TGCTCGGCCT CCCTGGCCGC ACTGCACCTG 

GACTACGCCC TCGCCGGCGG GGCCACGGTG 
GCCCGTCAGC GAGCGGTGTC GCCGGACGGC 

20 

GGGGTCGGTC TCGCCGAGGG AGCCGGGATG 
AAGAAGGGCC ATCCGGTACT GGCGGTGGTG 
AGCAACGGCC TCACCGCACC CAGCGGGCCC 

25 

GCCGACGCGG GGCTGACGCC CGCCGACGTG 
CCGCTCGGCG ACCCCATCGA GGCCGGCGCG 

30 GACGGCCCGC TGTGGCTGGG TTCGCTGAAG 

GGCGTGGCCG GGGTGATCAA GATGGTGCTG 
CTGCACGCGT CGACGGCGTC GTCCAGGATC 

55 GACGAGGCCA GGCCCTGGCT CCAGCGGGCC 

TTCGGCATCA GCGGCACCAA CGCGCACCTC 
CCCGAACTGC TCGCGCCCGA ACCGGCCGCC 

40 

CACGAGGTGA CCGTGCCCCT GATGATGTCC 
GCGCGGCGCC TGCGCGCCGA CCTGCTCGCC 
TACACCCTCA TCACCACCCG CACCCGGTTC 

45 

TTCACGGAGC TGATCGCGGC CCTCGACGAC 
CTGCGGGGCA CCGCCGGCAC CTCCGACCAG 

50 CAGTGGCCCG AGATGGCCGA CGGGCTGCTG 

GAGACCGCCC GCGCCTGCGA CCTCGCGCTC 
GTACTGCGCC GGGAACCCGG CGCGCCCTCG 

55 CTGTTCACCA TGATGGTCTC GCTCGCCGAG 



AACGAGACCC TCTACGACCC CGACCCCGAC 24900 

GGCGGGTTCC TGCACGACGC GGCCGAGTTC 24960 

GAGGCCCTGG CCATGGACCC GCAGCAGCGG 25020 

GAACGGGCCG GCATCGACCC GGTCGAACTG 25080 

ACCAACGGAC AGCACTACGT GCCGCTCCTC 25140 

ATCGCCACCG GCAACTCCGC CAGCGTGATG 25200 

GAGGGCCCCG CCGTCACCGT CGACACCGCC 25260 

GCGGTGCAGT CACTGCGCCG CGGCGAATGC 25320 

ATGTCCACCC CCGAGATGCT GGTGGAGTTC 25380 

CGCAGCAAGG CGTTCGCGGA GGCGGCCGAC 25440 

CTGCTCGTGG AGCGGCTGTC GGAGGCGCAG 25500 

CGGGGCAGTG CCGTCAACCA GGACGGTGCC 25560 

GCCCAGCAGC GGGTGATACG GGAGGCGCTG 25620 

GACGCGGTCG AGGCGCACGG CACCGGCACG 25680 

CTGCTCGCCA CGTACGGCCG GGACCGGCGC 25740 

TCGAACATCG GGCACACCCA GGCCGCCGCC 25800 

GCGCTGCGCC ACGGCGAGCT GCCGCGCACC 25860 

GATTGGGACG CGGGCGCCGT GGAGTTGCTG 25920 

GAGGGGCCGC GCCGGGCGGG CATCTCCTCG 25980 

GTCATCGAGG AGCCGCCGGA GCCCACCGCG 26040 

GACGGCGACG TCTGGTCCGA GGAGTGGTGG 26100 

GCGCACAACG AAGCCGCCCT GCGCGACCAG 26160 

CACCCCGAGC TGCACCCGGC CGACGTCGGC 26220 

GAGCAGCGGG CCGCCGTCGT CGGCGAGAAC 26280 

CTCGTCGAAG GCCGACCGCA CCCGCTCGTG 26340 

GTCGTGTTCG TCTTCCCCGG CCAGGGCTCG 26400 

GCCCGCTCCA GCGGCTCCGG CTCCTTCCTG 26460 

CGGCCCCACC TCGGCTGGTC CGTCCTGGAC 26520- 

CTCGACCGGG TCGACGTGGT GCAGCCCGTG 26580 

ACGTGGCGTT CGCTGGGCGT CGAACCGGCC 26640 
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GCGGTCGTCG GTCACTCCCA GGGCGAGATC 
CTGGACGACG CGGCGCGCAT CGTCGCCCTG 

5 

AAGGGCGGCA TGGTCGCCGT GACCCTGTCC 
TGGAGCGACC GGCTCGCCGT CGCCGCCGTC 

w GACCCGGACG CCCTGGCGGA GCTGGTCGCC 

CCCATCCCCG GCGTCGACAC CGCCGGGCAC 
CTGCGGAAGG TGCTCGCGCC CGTCGCGCCC 

15 GTCACCGGAG GACTGATCGA CACCGCCGAG 

CGCGAGCCGG TGGAGTTCGA GCAGGCCACC 
TTCCTGGAGT CGAGCCCGCA CCCCATGCTG 

20 GCCGGTTCCC CGGCGGCCGT CCTCGGCACC 

CTGGGCGTCG CCCTCTGCCG CGCCTACACC 
TTCGGCCCCG ACTCACGCCA GGTGGAACTG 

25 

TGGTACAGCC CCGGCCACCG CGGTGACGAC 
CACCCGCTGC TGGGCAGCGG CGTCGAACTG 

30 CGGCTGGGCG CCGACACCAC CCCGTGGCTG 

CTGCCCGGCG CCGCCTTCGC CGACCTGGCG 
CGCGTCGAGG AGCTCACCCT GGCCGCGCCC 

35 CTGCGGCTGA ACGTCGGCGC CCCGGGCACC 

CGCGCCGAGG GCGCCACGGA CTGGACCCTG 
ACGGCCGACG CGCCGGACGC CTCGGCGGCC 

40 

ATCGGCGACT TCTACCAGCG CTTCTCCGAA 
GGACTGGTGA GCGCCCACCG CTGCGGCCCC 
CAGGCGCAGG GCGACGCGGC CCGCTTCGGC 

45 

CAGACCATGA GCCTCGGGGG CTTCTTCCCC 
CTGCGCGGCG TTCGGCTGTA CCGCGCCGGA 

50 GTCTCCGAGG ACGCGGTCCG CATCAGGTGC 

ATCGAGTCCT TCATCATGCG GCCGGTCGAC 
GGCGCCGACG CGCTCTTCCG CATCGCCTGG 

55 ACCGGCGACG GCACCCCTCC CCCGGTGCGC 



GCCGCCGCCT ACGTCGCCGG CGCCCTGACG 26700 

CGCAGCCAGG CGTGGCTGCG GCTGGCCGGC 26760 

GAACGCGACC TGCGTCCCCG CCTGGAGCCC 26820 

AACGGCCCCG AGACCTGCGC CGTCTCCGGG 26880 

GAACTCGGTG CGGAGGGCGT GCACGCCCGC 26940 

TCGCCGCAGG TCGACACGCT GGAGGCCCAC 27000 

CGCACCTCCG ACATCCCGTT CTACTCGACG 27060 

CTGGACGCCG ACTACTGGTA CCGCAACATG 27120 

CGCGCCCTGA TCGCCGACGG CCACGACGTG 27180 

GCCGTCTCCC TCCAGGAGAC GATCAGCGAC 27240 

CTGCGGCGCG GCCAGGGCGG CCCCCGCTGG 27300 

CACGGCCTGG AGATCGACGC CGAGGCCATC 27360 

CCCACGTACC CCTTCCAGCG CGAGCGCTAC 27420 

CCCGCCTCCC TCGGTCTGGA CGCCGTCGAC 27480 

CCGGAGTCCG GTGACCGGAT GTACACCGCA 27540 

GCCGACCACG CGCTGCTGGG GTCGCCGCTG 27600 

CTCTGGGCCG GCCGCCAGGC CGGCACCGGC 27660 

CTGGTGCTGC CCGGCTCCGG GGGTGTCCGG 27720 

GACGACGCCC GCCGCTTCGC CGTGCACGCC 27780 

CACGCCGAGG GGCTGCTCAC CGCGC AGGAC 27840 

ACCCCGCCCC CCGGCGCCGA ACAACTGGAC 27900 

CTCGGTTACG GCTACGGCCC GTTCTTCCGG 27960 

GACATCCACG CGGAGGTCGC GCTGCCCGTC 28020 

ATCCATCCCG CGCTGCTGGA CGCGGCGCTG 28080 

GAGGACGGCC GCGTCCGCAT GCCGTTCGCC 28140 

GCCGACCGGC TGCACGTGCG CGTCTCGCCC 28200 

GCCGACGGCG AGGGACGGCC GGTCGCCGAG 28260 

CCGGGACAGC TCCTGGGCGG CCGCCCGGTC 28320 

CGGGAACTCG CCGCCGGCCC GGGCACCCGT 28380 

TGGGTGCTGG CGGGACCCGA CGCGCTGGGC 28440 
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CTGGCCGAGG CGGCCGACGC CCACCTGCCC GCCGTTCCCG GCCCGGACGG CGCACTGCCG 28500 

TCCCCGACGG GACGCCCGGC GCCCGACGCC GTCGTGTTCG CGGTCCGTGC CGGGACCGGC 28560 

5 

GACGTCGCCG CCGACGCGCA CACCGTGGCC TGCCGGGTGC TGGACCTCGT CCAGCGCCGG 28620 

CTCGCGGCCC CGGAGGGCCC GGACGGCGCC CGCCTGGTGG TGGCCACCCG CGGCGCGGTC 28680 

70 GCCGTACGCG ACGACGCCGA GGTGGACGAC CCGGCCGCGG CCGCCGCGTG GGGCCTGCTG 28740 

CGCTCCGCGC AGGCCGAGGA GCCCGGCCGG TTCCTGCTCG TGGACCTGGA CGACGACCCG 28800 

GCGTCCGCCC GGGCGCTGAC CGACGCCCTC GCCTCCGGCG AACCGCAGAC CGCGGTCCGG 28860 

15 GCCGGGACGG TGTACGTGCC CCGGCTGGAG CGGGCCGCCG ACCGCACGGA CGGGCCGCTC 28920 

ACCCCGCCCG ACGACGGTGC CTGGCGGCTG GGCCGGGGCA CCGACCTCAC CCTCGACGGC 28980 

CTCGCCCTGG TGCCCGCCCC GGACGCCGAG GCGCCGCTGG AGCCCGGCCA GGTGCGCGTC 29040 

20 

GCCGTACGCG CCGCGGGCGT CAACTTCCGC GACGCCCTCA TCGCCCTCGG CATGTACCCG 29100 

GGCGAGGCGG AGATGGGAAC GGAGGGCGCC GGCACCGTCG TCGAGGTCGG CCCCGGCGTC 29160 

ACCGGTGTCG CCGTCGGCGA CCGCGTGCTC GGCCTGTGGG ACGGCGGCCT GGGCCCGCTG 29220 

25 

TGCGTGGCCG ACCACCGGCT GCTCGCCCCC GTCCCGGACG GCTGGTCCTA CGCCCAGGCC 29280 

GCCTCGGTCC CCGCGGTGTT CCTCAGCGCC TACTACGGTC TGGTCACCCT GGCCGGCCTC 29340 

30 AGGCCGGGGG AGCGGGTGCT CGTGCACGCC GCCGCCGGGG GCGTCGGCAT GGCCGCGGTG 29400 

CAGATCGCCC GCCACCTCGG CGCGGAGGTG CTGGCCACCG CGAGCCCCGG CAAGTGGGAC 29460 

GCCCTGCGCG CCATGGGCAT CACCGACGAC CACCTCGCCT CCTCCCGCAC CCTCGACTTC 29520 

35 GCGACCGCCT TCACCGGAGC GGACGGCACG TCCCGCGCGG ACGTCGTCCT GAACTCGCTC 29580 

ACCAAGGAGT TCGTGGACGC CTCCCTCGGG CTGCTCCGTC CGGGCGGCCG GTTCCTGGAG 29640 

CTGGGCAAGA CCGACGTCCG GGACCCCGAG CGGATCGCCG CCGAACACCC CQGGGTGCGC 29700 

40 

TACCGGGCGT TCGACCTCAA CGAGGCCGGA CCCGACGCAC TCGGCCGGCT GCTGCGGGAA 29760 

CTGATGGACC TGTTCGCCGC CGGCGTGCTG CACCCGCTGC CCGTCGTCAC CCACGACGTG 29820 

4s CGCCGGGCCG CGGACGCCCT GCGCACCATC AGCCAGGCCC GGCACACCGG AAAGCTCGTC 29880 

CTGACCATGC CGCCCGCCTG GCACCCGTAC GGCACGGTCC TGGTCACCGG TGGCACCGGC 29940 

GCCCTCGGCA GCCGCATCGC CCGCCACCTG GCGAGCCGGC ACGGCGTCCG CCGGCTGCTG 30000 

50 ATCGCCGCCC GCCGGGGCCC GGACGGCGAG GGCGCCGCGG AGCTGGTCGC CGACCTCGCC 30060 

GCCCTGGGCG CGTCGGCCAC CGTGGTCGCC TGCGACGTCT CCGACGCGGA CGCCGTCCGC 30120 

GGACTGCTCG CCGGCATACC GGCCGATCAC CCGCTGACGG CGGTGGTGCA CAGCACCGGC 30180 

55 

GTCCTCGACG ACGGCGTGCT GCCCGGGCTC ACCCCCGAGC GGATGCGGCG CGTGCTGCGG 30240 
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CCCAAGGTGG AGGCCGCCGT CCACCTGGAC 
TTCGTCCTCT TCTCCTCCAG CGCCGGTCTG 

5 

GCGGCCAACG CCACCCTCGA CGCCCTCGCC 
GTGTCACTCG CCTGGGGTCT GTGGTCCGAC 

w GAGAGCCTCC AGCGGCGCTT CGCCCGCAGC 

GCCGCGCTGT TCGACGCCGC CCTGCGGGTC 
GACCCGGCCG CGCTGCGCGC CACCGGAAGC 

15 TCCGCCCCGG CGACCGGGTC CGCGGCCCCG 

GGGACCGTCG GCGAGCCGCT CGCCGAGCGG 
GACCGGCTGC TCGGCCTGGT CGGCGAACAC 

20 

GCCGAGGTCC GGCCCGACCG GCCGTTCCGC 
GAACTGCGCA ACCGGATGGC GGCGGTCACC 
GACCACCCCA CCCCCGCCGC GCTGTCCTCG 

25 

CCGGTCACCA CCACACCGCT GCTGTCCGAA 
CTCACCCCCG AGCACCTCGC GGAGCTCGCC 

30 CTGCGCCTGG ACGCCCTGGC CGACCGCTGG 

GACGACGACA TCACCGACGT GCTGAGCAGC 
GACGAGCGGT ACGGCACGTC GTGACCGCCG 

35 GCACCACCAT GGCGAACGAA GAGAAGCTGC 

TGCACCGGGC CACCGAGCAG CTGCGTGCCC 
TCGTCGGGGC GGCCTGCCGA CTCCCCGGCG 

40 

TGCTGCACGC CGGTGCCGAC GCGGTCGGCC 
AGGGAAGGTA CTCGCCCGAC CCCGACACGC 
TCGTGCAGGG GGCCGACCGG TTCGACCCCG 

45 

TCACCATGGA CCCCCAGCAG CGGCTGCTGC 
CCGGTCTGGA CCCCCAGTCC CTGGCGGGCA 
50 AGAGCGGCTA CCAGAAGGGC GTCGAAGGGC 

CCGGCATCGT CAGCTTCACC GCCGGCCGCG 
CGCTGACGAT CGACACGGCC TGCTCCTCGT 

55 

CACTGCGCCG GGGCGAGTGC GACCTCGCAC 



GAACTCACCC GCGACCTCGA CCTGTCGGCG 30300 

CTGGGCAGCC CGGCCCAGGG CAACTACGCG 30360 

GCCCGGCGCC GGTCCCTCGG CCTCCCGTCG 30420 

ACCAGCCGGA TGGCACACGC ACTGGACCAG 3 0480 

GGCTTCCCGC CCCTGTCCGC CACGCTGGGC 3 0540 

GACGAGGCCG TGCAGGTCCC CATGCGGTTC 30600 

GTCCCCGCCC TGCTGTCGGA CCTCGTCGGG 30660 

GCGTCCGGCC CCCTTCCGGC TCCGGACGCC 30720 

TTGGCCGGAC TCTCCGCCGA GGAACGCCAC 30780 

GTGGCCGCGG TACTGGGCCA CGGCTCCGCC 30840 

GAGGTCGGGT TCGACTCGCT CACGGCCGTG 30900 

GGGGTCAGGC TCCCCGCCAC CCTGGTCTTC 30960 

CACCTCGACG GCCTGCTGGC CCCGGCACAG 31020 

CTGGACCGCA TCGAGGAGGC CCTGGCCGCC 31080 

CCCGCCCCCG ACGACCGGGC CGAGGTCGCC 31140 

CGCGCCCTGC ACGACGGCGC GCCCGGCGCC 31200 

GCCGACGACG ACGAGATCTT CGCGTTCATC 31260 

GCCCGGAGCC CCGCCCGTCA TCGAAAGGAA 31320 

GCGCCTACCT CAAGCGCGTG ACGGGTGAGC 31380 

TGGACCGGCG GGCCCACGAG CCGATCGCGA 31440 

GCGTCGAGAG TCCGGACGAC CTGTGGGAGC 31500 

CGGCCCCCGC CGACCGCGGC TGGGACGTGG 31560 

CCGGCACCTC GTACTGCCGC GAGQGCGGCT 31620 

CCCTCTTCGG CATCTCGCCC AACGAGGCGC 31680 

TGGAGACCTC CTGGGAGGCG CTGGAGCGAG 31740 

GCCGGACCGG CGTGTTCGCC GGGGCGTGGG 31800 

TCGAAGCCGA TCTGGAGGCC CAACTCCTGG 31860 

TCGCCTACGC CCTGGGCCTG GAGGGCCCGG 31920 

CGCTGGTGGC ACTGCACCTG GCGGTGCAGT 31980 

TGGCGGGCGG CGCCACGGTC ATCGCCGACT 32040 
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TCGCGCTCTT CACCCAGTTC TCCCGGCAGC GCGGGCTCGC CCCCGACGGG CGGTGCAAGG 32100 

CCTTCGGTCA GACGGCCGAC GGCTTCGGCC CCGCCGAGGG CGCGGGGATG CTGCTGGTCG 32160 

5 

AGCGGCTGTC GGACGCCCGC CGCAACGGGC ACCCGGTGCT GGCGGTGGTG CGGGGCAGTG 32220 

CCGTCAACCA GGACGGTGCG AGCAATGGGC TGACGGCGCC GAGTGGTCCT GCGCAGCAGC 32280 

10 GGGTGATCCG TGAGGCGCTG GCCGACGCGG GGCTGACGCC CGCCGACGTG GACGCGGTCG 32340 

AGGCGCACGG CACCGGCACG CCGCTCGGCG ACCCCATCGA GGCCGGCGCG CTCATGGCGA 32400 

CGTACGGGCA CGAACGGACG GGCGACCCGC TGTGGCTGGG TTCGCTGAAG TCGAACATCG 32460 

15 GGCACACCCA GGCCGCCGCC GGCGTGGCCG GGGTGATCAA GATGGTGCTG GCGCTGCGCC 32520 

ACGGTGAGCT GCCGCGCACC CTGCACGCGT CGACGGCGTC CTCCAGGATC GAATGGGACG 32580 

CGGGCGCCGT GGAGTTGCTG GACGAGGCCA GGCCCTGGCC CCGGCGTGCC GAGGGGCCGC 32640 

20 

GCCGGGCGGG CATCTCCTCG TTCGGCATCA GCGGCACCAA CGCGCACCTC GTCATCGAGG 32700 

AGGAGCCGCC CGCCCGGCCG GAGCCCGAGG AGGCCGCGCA GCCGCCCGCC CCGGCCACCA 32760 

CCGTCCTCCC GCTGTCGGCC GCCGGCGCGC GATCCCTGCG CGAGCAGGCC CGCAGGCTCG 32820 

25 

CCGCGCACCT GGCCGGCCAC GAGGAGATCA CCGCCGCCGA CGCCGCCCGC TCCGCCGCCA 32880 

CCACCCGTGC CGCGCTCTCG CACCGGGCCT CGGTCCTGGC CGACGACCGG CGGGCGCTGA 32940 

30 TCGACAGGCT GACCGCGCTG GCGGAGGACA GGAAGGACCC CGGCGTCACC GTCGGCGAGG 33000 

CGGGCAGCGG CCGGCCCCCC GTCTTCGTCT TCCCGGGACA GGGCTCCCAG TGGACGGGCA 33060 

TGGGCGCCGA ACTCCTGGAC AGGGCACCGG TCTTCCGCGC CAAGGCCGAG GAGTGCGCGC 33120 

35 GGGCCCTCGC GGCCCACCTC GACTGGTCGG TGCTCGACGT CCTGCGCGAC GCGCCCGGCG 33180 

CCCCGCCGAT CGACCGCGCG GACGTCGTCC AGCCGACCCT GTTCACCATG ATGGTCTCCC 33240 

TCGCGGCGCT GTGGGAGTCC CACGGTCTAC GGCCCGCCGC CGTGGTCGGC CACTCCCAAG 33300 

40 

GCGAGATCGC CGCCGCCCAC GCGGCCGGTG CCCTGTCCCT CGACGACGCG GCCCGCGTGA 333 60 

TCGCCGAGCG CAGCAGGCTC TGGAAGCGGC TGGCCGGAAA CGGCGGCATG CTCTCCGTGA 33420 

TGGCCCCGGC CGACCGGGTC CGCGAACTGA TGGAGCCCTG GGCGGAGCGG ATGTCCGTGG 33480 

45 

CCGCCGTCAA CGGCCCCGCC TCGGTCACCG TGGCCGGTGA CGCGCGGGCG CTGGAGGAGT 33540 

TCGGCGGCCG GCTCTCCGCC GCCGGGGTGC TGCGCTGGCC CCTCGCCGGC GTCGACTTCG 33 600 

50 CCGGACACTC ACCCCAGGTG GAGCAGTTCC GCGCCGAGCT CCTCGACACG CPGGGCACCG 33 660 

TCCGCCCGAC CGCCGCCCGG CTGCCCTTCT TCTCCACCGT GACCGCCGCG GCGCACGAGC 33720 

CCGAAGGCCT GGACGCCGCG TACTGGTACC GGAACATGCG CGAACCCGTG GAGTTCGCGT 33780 

55 

CCACCCTGCG GACGCTGCTG CGCGAGGGCC ACCGCACCTT CGTCGAGATG GGCCCGCACC 33840 
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CCCTGCTGGG CGCCGCGATC GACGAGGTCG 
TCGCCACCCT CCACCGCGGC TCCGGCGGCC 

5 

CGTTCGCCCA CGGAGTACGG GTCGACTGGG 
GGGTCCCGCT GCCCACCTAC GCCTTCAGCC 

1Q GCCGGCGCGC CGTCGAGGCG GCCCCCGTCG 

CCTGGACACC CGTGGCATCC GACGACTCCG 
AGACCCCCGG CACCGCGCCG GACGAGGCGG 

?5 GGGTGGTCGT GGAGCGCTGC CTGCTGGATC 

AGCGACTGGC CGAACTGGAC GCGCAGCCGG 
GCCGTCCGCA GAGCACCGCA CCGGCCGACG 

20 

TCCTGCTCGT GGTCCAGGCC GTGCCGGACG 
CGCGGGGTGC GGTGGCGGTG GGGTCGGGTG 
GGGGTCTGGG GCGGGTGGCT GCGTTGGAGG 

25 

TGGCGGTGGG GGCGGGTGTG CGTGAGTGGC 
GTGAGGATCA GGTGGCGGTG CGTGGTGGGG 

30 GGGTGCGGGG TGGTTCGGGG GTGTGGCGTG 

TGGGTGGTGT GGGGGGTCAT GTGGCGCGGT 
TGTTGGCGGG GCGTCGGGGT GGTGGGGTTG 

35 TGGGGTTGGG GGCGAAGGTG ACGTTCGTTT 

TGGGGTTGTT GGGTGTGGTG GAGGGGTTGG 
CGGGGGTGGC TCAGGTGTCG GGGTTGGGTG 

40 

TGGGGGGTAA GGCGGTGGGG GCTGAGTTGT 
ATGCGTTCGT GTTGTTCTCG TCGGGTGCTG 
ATGCGGCGGC CAATGCGCAT CTGGATGCGT 

45 

CCGCGACCTC CGTCGCCTGG GGCCTGTGGG 
TCACGGAGTT CTACGCCGAG CGCGGCCTCG 

50 CACTGCACAC GGCACTGAAC GAGGGCGACA 

AACACTTCGT CACCGGGTTC ACCGCCTACC 
AGGTCCGCGC GTTGCGCACG CCCGAACCCA 

55 GCGTCGACGC CGCCCTCACC CCGCGCGAGC 



CCGAGGCCGA GGGCGTGCAC GCCACCGCCC 33900 

TGGACCGGTT CCGCTCCTCG GTGGGCGCCG 33960 

ACGCCCTCTT CGAGGGCTCC GGCGCCCGCC 34020 

GGGACCGGTA CTGGCTGCCC ACCGCCATCG 34080 

ACGCGTCCGC CCCCGGGCGC TACCGCGTCA 34140 

GCCGGCCCTC CGGGCGCTGG CTGCTGGTGC 34200 

ACACCGCGGC GTCGGCCCTC GGTGCGGCCG 34260 

CCACCGAGGC CGCGCGCGTC ACGCTCACCG 34320 

AGGGCCTGGC CGGCGTGCTG GTGCTGCCCG 34380 

CCTCCCCGCT CGACCCGGGG ACGGCCGCCG 34440 

CCGCTCCGAA GGCCCGGATC TGGGTGGTGA 34500 

AGGTGCCGTG TGCGGTGGGT GCGCGGGTGT 34560 

TGCCGGTGCA GTGGGGTGGG TTGGTGGATG 34620 

GTCGTGTGGT GGGTGTGGTT GCGGGGGGTG 34680 

GTGTGTTCGG TCGTCGTCTG GTGGGTGTGG 34740 

CGCGGGGGTG TGTGGTGGTG ACGGGTGGGT 34800 

GGTTGGCGCG TTCGGGTGCG GAGCATGTGG 34860 

TGGGGGCGGT GGAGTTGGAG CGGGAGTTGG 34920 

CGTGTGATGT GGGGGATCGG GCGTCGATGG 34980 

GGGTGCCGTT GCGTGGTGTG TTTCATGCGG 35040 

AGGTGTCGTT GGCGGAGGCG GGTGGTGTGT 35100 

TGGACGAGTT GACGGCGGGT GTGGAGCTGG 35160 

GGGTGTGGGG GAGTGGGGGG CAGTCGGTGT 3 5220 

TGGCGGAGCG TCGTCGTGCG CAGGGGCGTC 3 5280 

GCGGCGAGGG CATGGGAGCG GACGAAGGCG 3 5340 

CCCCCATGCG GCCCGAGTCG GGCATCGAGG 3 5400 

CCTGCGTCAC GGTCGCCGAC ATCGACTGGG 35460 

GGCCCAGCCC GCTGATCTCC GACATCCCCC 35520 

CCGTGGACGC CTCGGACGGA CTGCGCCGGC 35580 

GCACCAAGGT CCTGGTCGAC CTGGTCCGCA 35640 
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CGGTGGCGGC GGAGGTCCTC GGTCACGACG 
TCCGGGACCT CGGCTTCGAC TCGCTGGCCG 

5 

CGACCGGACT CGTACTGCCC GCGACGGTCA 
GCGGCGCGCT GCTGGAGCGG CTGTCCGCGG 

10 CCGCCGGGGG GAGGCCCGCG ACCCCACCGC 

ACATCGACGA ACTCGACGCG GACGCCCTGA 
CCGACGGCAC GCCGGCCGAC GGCGGGCCCG 

75 CGGAGCAGTA GCGCGCCCTC ACCGGCGCGC 

CCCCACAGCC AGCGAGCAGA CGAGGAAGCC 
GGGTGCGCTG CGCACCTCCG TCAAGGAGAC 

20 

CCTGGCCGGC GCGCACGAGC CGGTCGCCAT 
CGTGAGCACC CCGGACGACC TGTGGGAGCT 
CTTCCCGGCC GACCGGGGCT GGGACGAGGA 

25 

CGGCACCACC TACTGCCGTG AGGGCGGCTT 
CTTCTTCGGC ATCTCGCCGA ACGAGGCGCT 

30 GGAGACGTCG TGGGAGACGT TGGAGCGGGC 

CCGTACCGGT GTCTTCGTCG GAGCCGCGCA 
GCCCGAGGGC ACCGAGGGCT ATCTGCTGAC 

55 GATCGCCTAC TCCCTGGGTC TGGAGGGGCC 

GTCGTTGGTG GCGTTGCATC TGGCGGTGCA 
GTTGGCCGGC GGCGTCGCGG TCATGCCCGA 

40 

GCGGGGGCTG GCGGTGGACG GGCGGTGCAA 
GTGGGCGGAG GGAGTGGGTG TGCTGCTGGT 

45 CCATCGGGTG CTGGCGGTGG TGCGGGGCAG 

GCTGACGGCG C CG AGTGGTC CTGCGCAGCA 
GGGGCTGACG CCCGCCGACG TGGATGTGGT 

50 TGATCCGATC GAGGCGGGTG CGTTGCTGGC 

TTTGTGGTTG GGGTCGTTGA AGTCGAACAT 
TGGTGTGATC AAGGTGGTGC AGGCGATGCG 

55 

GGATGCGCCG TCGTCGAAGG TGGAGTGGGC 



GGATCGGCGG CATCGGCCAC GACGTGGCCT 35700 

CGGTCCCGAT GCGCGGCCGG CTGGCCGAGG 35760 

TC1TCGACCA CCCCACCGTG GACCGGCTCG 35820 

ACGAACCCGC GCCCGGCGGG GCGCCGGAGC 35880 

CCGCACCGGA GCCGGCCGTC CACGACGCCG 35940 

TCCGGCTGGC CACGGGAACC GCCGGACCGG 3 6000 

ACGCGGCGGC GACCGCCCCC GACGGAGCAC 3 6060 

CGACCGGCGG AGCGCCGTAC CGCCGACGCC 36120 

GAAGATGTCA CCGTCCATGG ACGAAGTGCT 3 6180 

CGAGCGGCTG CGCCGGCACA ACCGGGAGCT 3 6240 

CGTGGGCATG GCCTGCCGCT ACCCCGGTGG 3 6300 

CGCCGCGGAC GGCGTCGACG CGATCACCCC 36360 

CGCCGTCTAC TCGCCCGACC CCGACACCCC 3 6420 

CCTCACCGGC GCCGGGGACT TCGACGCGGC 36480 

GGTGATGGAC CCGCAGCAGC GGCTGTTGCT 36540 

CGGCATCGTC CCCGCGTCGC TGCGCGGCAG 3 6600 

CACGGGATAC GTCACCGACA CCGCGCGAGC 3 6660 

GGGCAACGCC GATGCCGTCA TGTCCGGCCG 3 6720 

GGGGCTGACG ATCGGGACGG CCTGCTCGTC 36780 

GTCGTTGCGG CGGGGCGAGT GCGACCTGGC 3 6840 

CCCGACGGTG TTCGTGGAGT TCTCGCGGCA 36900 

GGCGTTCGCG GAGGGTGCGG ACGGGACGGC 3 6960 

GGAGCGGCTT TCCGACGCGC GCCGCAATGG 37020 

TGCGGTCAAT CAGGACGGGG CGAGCAATGG 37080 

GCGGGTGATC CGTGAGGCGC TGGCTGATGC 37140 

GGAGGCGCAC GGTACGGGGA CGGCGTTGGG 37200 

CACGTACGGG CGGGAGCGGG TCGGTGATCC 37260 

CGGGCATGCG CAGGCGGCTG CGGGTGTGGG 37320 

GCATGGGTCG TTGCCGCGGA CGCTGCATGT 37380 

TTCGGGTGCG GTGGAGCTGC TGACCGAGGG 37440 
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CCGGTCGTGG CCGCGGCGGG TGGAGCGGGT 
GAGCGGGACC AACGCCCATG TGGTCCTGGA 

5 

CGGGGACGGC CCCGGACCCG ACCGGCCCGA 
CTCGGCACGC TCGCGGGAGG CGCTG CGCGG 

10 CCAGGGGCGC ACGGAGGGCA CCGGCGGCGG 

CGGATACTCC CTGGCCACCA CCAGGGAGAC 
GGAGAACCGG ACGGCCGGGG AGGACCTCGC 

15 CGTGGTCACG GGTGTCGCGC GACGTGGCCG 

CGCCCAGCGG CTCGGCGCCG GTCGGGAGCT 
CCTCGACGAG ATC GCGGCGG AGTTCGACGC 

20 

GTTCGCCGAG CCCGCCACGC CGGACGCCGC 
GGCCCTCTTC GCGGTGGAGA CCGCGCTCTT 
GGACGTCCTC GTGGGCCACT CGATCGGCGG 

25 

CTCTGCGGCC GACGCGGCCC GGCTGGTCTC 
CGAGGGCGGC GCGATGGCGG CCGTGCAGGC 

30 CGTCGCCGCC GGCGGCGCGG TGGTCGCCGC 

CGGGGACGAG GCGGCCGTAC TGGCGGCGGC 
CAAGCGCCTG AGGGTGAGCC ACGCCTTCCA 

35 CTTCCGCGCG GTG GCGGACA CGGTCGACTA 

AGTGACCGGC GACCTCGCCG ACGCCGCCCA 
GGTGCGGCAG CCGGTGCGCT TCGCCGACGC 

40 

GACCTTCATC GAGCTCGGGC CCGACGCCGT 
CGCGGAGGCC GACGTCGTGT TCGCCCCGGC 
CGTGCTCCGG GCCGCCGCGA GCGCGTACGT 

45 

CTACGGCGGC ACGGGAGCCC GCCGCACCGA 
CTACTGGCTC GCCCCCGCCT CGGCCGCGGT 
50 ATCCGTGCCG GAAGCCGAGC AGGACGGGGC 

CGCCTCGGCC GCGGCGCGAC TGGGCGCCGA 
GGTCCTGCCG CACCTGGCCG CCTGGCACGA 

55 

CCTGCACTAC CGCGTCACCT GGCAGGCGAT 



GCGGCGGGCC GCGGTGTCGG CGTTCGGGGT 37500 

GGAAGCACCG GTCGAGGCCG GGAGCGAGCA 37560 

CGCCGTGACG GGTCCGCTCC CCTGGGTGCT 37620 

CCAGGCCGGA CGACTCGCCG CTCTCGCCCG 37680 

CAGCGGACTC GTCGTCCCCG CGGCCGACAT 37740 

CCTGGAGCAC CGGGCGGTGG CGCTGGTGCA 37800 

CGCGCTGGCC GCCGGCCGCA CACCGGAGAG 37860 

CGGGATCGCC TTCCTCTGCT CGGGGCAGGG 37920 

CCGCGGCAGG TTCCCCGTCT TCGCCGACGC 37980 

CCACCTCGAA CGCCCTCTCC TGTCGGTGAT 38040 

ACTCCTCGAC CGCACCGACT ACACCCAGCC 38100 

CCGGCTCCTG GAGAGCTGGG GCCTGGTCCC 38160 

TCTGGTGGCG GCTCACGTGG CGGGCGTCTT 38220 

CGCACGCGGC CGGCTCATGC GGGCCCTGCC 38280 

CACCGAGCGG GAGGCCGCCG CGCTGGAGCC 38340 

GGTCAACGGC CCGCAGGCCC TCGTGCTCTC 38400 

CGGTGAACTG GCCGCCCGCG GACGCCGCAC 38460 

CTCACCCCGT ATGGACGCCA TGCTCGCCGA 38520 

CCACGCCCCC CGGCTGCCGG TCGTCTCCGA 38580 

GCTGACCGAC CCCGGCTACT GGACCCGCCA 38640 

CGTGCGCACC GCGAGCGCCC GGGACGCCGC 38700 

CCTGTGCGGC ATGGCGGAGG AGTCCCTGGC 38760 

ACTGCGCCGC GGGCGCCCGG AGGGCGACAC 38820 

CCGCGGCGCG GGCCTCGACT GGGCCGCGCT 38880 

CCTGCCCACC TACGCCTTCC AGCACAGCCG 38940 

CGCCCCCGCG ACGGCCGCCC CCTCCGTCCG 39000 

GCTGTGGGCC GCCGTGCACG CCGGTGACGT 39060 

CGACGCCGGT ATCGAACACG AACTGCGCGC 39120 

CCGCGACCGC GCGACCGCGC GGACCGCGGG 39180 

CGAGGCAGAC GCTGTCAGGT TCAGCCCCTC 39240 
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GGATCGCTGG CTGATGGTCG AGCATGGGCA 
CCCGCTGCGC GCGGCCGGCG CGGAGGTCAC 

5 

CGGATCACCG CGGACGGAGA CCCCGGACCG 
CGCACGGAGC CCGGAGGGCC TGGCCGGCGT 

10 GGTCGCCGGG CACCCCGGGC TGGACCAGGG 

ACTGACCGAC GCCGCGGTGC GGGCACCGCT 
GGGGTCGGGT GAGGTGCCGT GTGCGGTGGG 

15 TGCGTTGGAG GTGCCGGTGC AGTGGGGTGG 

GCGTGAGTGG CGTCGTGTGG TGGGTGTGGT 
GCGTGGTCGG GGTGTGTTCG GTCGTCGTCT 

20 

GGTGTGGCGT GCGCGGGGGT GTGTGGTGGT 
TGTGGCGCGG TGGTTGGCGC GTTCGGGTGC 
TGGTGGGGTT GTGGGGGCGG TGGAGTTGGA 

25 

GACGTTCGTT TCGTGTGATG TGGGGGATCG 
GGAGGGGTTG GGGGTGCCGT TGCGTGGTGT 

30 GGGGTTGGGT GAGGTGTCGT TGGCGGAGGC 

GGCTGAGTTG TTGGACGAGT TGACGGCGGG 
GTCGGGTGCT GGGGTGTGGG GGAGTGGGGG 

35 TCTGGATGCG TTGGCGGAGC GTCGTCGTGC 

GGGCCCGTGG GACGGCGACG GCATGGGCGA 
CGGCGTGGCC CCGCTCCACC CCGAGACGGC 

40 

CGGCGAAGCC ACGGTCACCG TGGCGGACAT 
CGCCTTCCGT CCCAGCCCCC TGATCGCCGG 
CGGCCGGCCC GCCGAGGACA CCCCCACCGC 

45 

CCGGCCGCGG CTCGCCCTGG ACCTGGTGCT 
CGAGGACGCC CGGGTCGACG CCCGGGCCCC 
50 CGCGGTGCGG CTGCGCCGCC GGCTGGCCGA 

CGTCTTCGAC CACGAGGACC CCACCGCGCT 
GGGGACCCCC GGCCCCCAGG AGGGCACGGC 

55 

CCGCGCCGCC GTCGAACAGC GCAGGTCGAG 



GCACACGGAA TGCGCGGACG CCGCGGAACG 39300 

CCGCCTGGTG TGGCCGCTGG AGCAGCACAC 39360 

CGGCACCCTG GCGGCCCGGC TGGCCGAGCT 39420 

GCTGCTGCTC CCCGACTCGG GCGGTGCCGC 39480 

AACGGCGGCG GTGCTGCTGA CGATCCAGGC 39540 

GTGGGTGGTG ACGCGGGGTG CGGTGGCGGT 39600 

TGCGCGGGTG TGGGGTCTGG GGCGGGTGGC 39660 

GTTGGTGGAT GTGGCGGTGG GGGCGGGTGT 39720 

TGCGGGGGGT GGTGAGGATC AGGTGGCGGT 39780 

GGTGGGTGTG GGGGTGCGGG GTGGTTCGGG 39840 

GACGGGTGGG TTGGGTGGTG TGGGGGGTCA 39900 

GGAGCATGTG GTGTTGGCGG GGCGTCGGGG 39960 

GCGGGAGTTG GTGGGGTTGG GGGCGAAGGT 40020 

GGCGTCGGTG GTGGGGTTGT TGGGTGTGGT 40080 

GTTTCATGCG GCGGGGGTGG CTCAGGTGTC 40140 

GGGTGGTGTG TTGGGGGGTA AGGCGGTGGG 40200 

TGTGGAGCTG GATGCGTTCG TGTTGTTCTC 40260 

GCAGTCGGTG TATGCGGCGG CCAATGCGCA 40320 

GCAGGGGCGT CCCGCGACCT CCGTCGCCTG 40380 

GATGGCGCCC GAGGGCTACT TCGCCCGCCA 40440 

GCTCACCGCC CTGCACCAGG CCATCGACGG 40500 

CGACTGGGAA CGGTTCGCCC CCGGCTTCAC 40560 

CATCCCCGCG GCCCGTACGG CGCCCGCCGC 40620 

CCCCGGCCTC CTGCGGGCGC GGCCCGAGGA 40680 

CCGCCACGTC GCGGCGGTCC TCGGCCACTC 40740 

CTTCCGGGAC CTCGGCTTCG ACTCGCTCGC 40800 

GGACACCGGG CTCGACCTGC CCGGCACCCT 40860 

GGCCCACCAC CTGGCCGGCC TCGCCGACGC 40920 

TCGGGCCGAG AGCGGGCTGT TCGCCTCCTT 40980 

CGAGGTCGTG GAGCTGATGG CCGACCTGGC 41040 



132 



EP 0 791 655 A2 



GGCGTTCCGG CCCGCCTACT CCCGGCAGCA CCCCGGCTCC GGCCGCCCCG CGCCCGTACC 41100 

CCTCGCGACC GGACCGGCGA CGCGTCCCAC GCTGTACTGC TGCGCCGGCA CCGCGGTCGG 41160 

5 

CTCCGGGCCC GCCGAGTACG TCCCG1TCGC CGAAGGACTG CGCGGCGTCC GGGAGACGGT 41220 

CGCCCTTCCC CTGTCCGGCT TCGGCGACCC CGCGGAACCG ATGCCCGCAT CGCTCGACGC 41280 

10 GCTGATCGAG GTCCAGGCCG ACGTCCTCCT GGAGCACACC GCGGGCAAGC CCTTCGCCCT 41340 

CGCCGGCCAC TCCGCCGGCG CGAACATCGC CCACGCCCTG GCCGCCCGGC TGGAGGAACG 41400 

CGGCTCGGGC CCCGCAGCCG TCGTACTGAT GGACGTCTAC CGTCCCGAGG ACCCCGGTGC 41460 

15 GATGGGCGAG TGGCGCGACG ACCTGCTCAG CTGGGCGCTC GAACGCAGCA CGGTGCCCCT 41520 

GGAGGACCAC CGGCTCACCG CCATGGCCGG CTATCAGCGG CTGGTGCTCG GAACCCGGCT 41580 

CACCGCCCTC GAAGCCCCCG TCCTGCTGGC CCGGGCGTCC GAACCCCTGT GCGCGTGGCC 41640 

20 

GCCCGCGGGC GGGGCGCGGG GCGACTGGCG GTCCCAGGTC CCGTTCGCAC GGACCGTCGC 417 00 

CGACGTGCCC GGCAACCACT TCACCATGCT CACCGAACAC GCCCGGCACA CCGCGTCCCT 417 60 

GGTGCACGAA TGGCTGGACA GCCTCCCGCA CCAGCCCGGT CCCGCCCCGC TCACCGGAGG 41820 

25 

GAAACACTGA TGTACGCCGA CGACATCGCG GCCGTCTACG ACCTGGTCCA CGAGGGGAAG 41880 

GGGAAGGACT ACCGGCAGGA GGCCGAGGAG ATCGCCGCAC TCGTGCGCGT CCACCGGCCG 41940 

30 GGCGCCCGGA CCCTGCTCGA CGTGGCCTGC GGCACCGGCC AGCACCTGCA CCACCTGGAC 42000 

GGCCTCTTCG ACCACGTCGA GGGCCTGGAA CTCTCCGCCG ACATGCTGGC CCTCGCGACC 42060 

GGCCGGAACC CCGGTGTCAC CTTCCACCAA GGGGACATGC GCTCGTTCTC CCTGGGACGC 42120 

55 CGGTTCGACG CGGTGACCTG CATGTTCAGC TCCATAGGCC ACCTGCGGAC CACCGACGAA 42180 

CTCGACAGCA CGCTGCGGGC CTTCACCGAC CACCTCGAAC CGTCCGGCGT CATCGTCGTC 42240 

GAACCCTGGT GGTTCCCCGA GTCCTTCACC CCCGGTTACG TCGGCGCCAG CATCACGGAG 42300 

40 

GC GGGCGAGC GCACCGTCTG CCGGGTCTCG CACTCCGTAC GGGAGGGGAA CGCCACCCGC 42360 

ATCGAGGTGC ACTACCTCCT CGCCGGACCC GGCGGCGTCC GTCACCTGAC CGAGGACCAC 42420 

4s ACCATCACCC TGTTCCCGCG CGCCGACTAC GAGGCGGCCT TCGAGCGCGC CGGCTGCGAC 42480 

GTGGTCTACC AGGAAGGCGG CCCGTCCGGT CGCGGGCTGT TCATCGGCAC CCGCCGCTGA 42540 

CCCGGTGCCG ACGCGGACCG CCGCGGCCCG GAGGCGGGTT GCCCCGACCC ACCCGGCACA 42600 

50 CCCGGGTCCC CCGATCGTGC GAGCGCCCCC ATCGACCCGA GAAGAAAGGC AGGGCAGCCA 42660 

TGCCCACCCT TGCCACGGAA ACGGCCCCCG CGAGCACGAG CACGAGCGCG GGCACGAGCA 42720 

CGGGCGTCCG TGCGCTCGGC CGTCGGCTCC AGCTGACCCG GGCCGCACAC TGGTGCGCCG 42780 

55 

GCAACCAGGG CGACCCGTAC GCGCTGATCC TGCGCGCCGT CGCCGACCCC GAGCCGTTCG 42840 
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AACGGGAGAT CCGGGCCCGC GGACCGTGGT TCCGCAGCGA ACAGCTGGAC GCCTGGGTGA 42900 

CCGCGGACCC CGAGGTGGCG GCGGCCGTCC TGGCCGACCC GCGCTTCGGC ACGCTGGACC 42960 

GGGCCGGACG CCGCCCGGAC GAGGAACTGC TGCCCCTCGC CGAGGCGTTC CCCCACCACG 43020 

AACGCGCGGA GCTCGTACGC CTGCGGGCGC TGGCCGCCCC GGTGCTCAGC CGGTACGCCC 43080 

CGGCCCAGGC GCCCTGCGCG GCGCGCACCA CCGCCCGCAG AGTGCTCGGC CGCCTGCTGC 43140 

CCACCGGTGA CGCCGGGTTC GACCTTGTCG GCGAGGTCGC CCGGCCCTAC GCCGTCGAGC 43200 

TGATGCTCAG GCTCCTCGGA GTGCCGGGCC GCGACCGCGC CACCGCCGCG CGGGCACTCG 43260 

CCGCCTGCGG CCCCCAGCTC GACGCCCGGA TGGCCCCGCA ACTGCTGACC GTGGCCCGGG 43320 

AGTCCGCCGA CGCCGTCCGC ACACTGGCCG ACCTGGTCCC CGAGCTCGTC GCGGAGAAGT 43380 

CCCGGGGCCT CGGGAACGCC GAGCCCCGGC CCGACGACGT GCTCGCCCTC CTCCTGCACG 43440 

ACGGCGTCGC CCCCGGCGAC GTCGAGCGCA TCGCGCTGCT CCTCGCGGTC GGCGCACCCG 43500 

AACCCGTCGT CACCGCCGTC GCGCACACGG TCCACCGGCT GCTCGGCCGG CCGGGGGAGT 43560 

GGGAGAGGGC CCGCCGGACG CCGGCCGCGG CGAACGCCGT CGACCAGGTG CTGCGCGAGC 43620 

GCCCCCCGGC CCGGCTGGAG AACCGGGTCG CGCACACCGG CCTCGAACTC GGCGGCCGCC 43 680 

GGATCACCGC CGACGAGCAC GTCGTGGTGC TGGCCGCCGC CGGACGGGAG ATCCCCGGGC 43740 

CGGAGCCGCT CGGGGGCGCC GACGGACCGC ACCTGGCGCT CGCCCTCCCG CTGATCCGCC 43800 

TGGCCGCCAC CACCGCGGTC CAGGTCACGG CCGGCCGCCT GCCCGGCCTG CGGGCCGAGG 43860 

GACCGCCCCT GACCCGGCCG CGGTCACCGG TCCTGGGCGC CTGCGCCCGC CTCCGGGTCC 43920 

ACCCGGGATG ACCCCGCCGT CCGTACGCCC CCTCCCAGAC CGGAGCCGCT GTGCGCGTCC 43980 

TGCTGACATC CCTCGCCCAC AACACCCACT ACTACAGTCT GGTGCCCCTC GCCTGGGCGC 44040 

TGCGCGCCGC CGGGCACGAG GTACGGGTGG CGAGCCCGCC CTCCCTCACC GACGTCATCA 44100 

CCTCCACCGG TCTGACCGCC GTACCGGTGG GCGACGACCG ACCGGCCGCG GAGCTGCTCG 44160 

CCGAGATGGG CAGAGACCTC GTCCCCTACC AGAGGGGCTT CGAGTTCGGT GAGGTGGAGA 44220 

3GCGAGGAGGA GACCACCTGG GAGTACCTGC TCGGCCAGCA GAGCATGATG GCCGCCCTCT 44280 

GCTTCGCCCC GTTCAACGGC GCCGCCACGA TGGACGAGAT CGTCGACTTC GCCCGTGGCT 44340 

GGCGGCCCGA CCTGGTCGTG TGGGAACCCT GGACCTA 44377 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4550 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Ser Gly Glu Leu Ala lie Ser Arg Ser Asp Asp Arg Ser Asp Ala 
15 10 15 

Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He 
20 25 30 

Ala Glu Phe Trp Lys Leu Leu Thr Asp Gly Arg Asp Ala He Gly Arg 
35 40 45 

Asp Ala Asp Gly Arg Arg Arg Gly Met He Glu Ala Pro Gly Asp Phe 
50 55 60 

Asp Ala Ala Phe Phe Gly Met Ser Pro Arg Glu Ala Ala Glu Thr Asp 
65 70 75 80 

Pro Gin Gin Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp 
85 90 95 

Ala Gly He Val Pro Gly Ser Leu Arg Gly Glu Ala Val Gly Val Phe 
100 105 110 

Val Gly Ala Met His Asp Asp Tyr Ala Thr Leu Leu His Arg Ala Gly 
115 120 125 

Ala Pro Val Gly Pro His Thr Ala Thr Gly Leu Gin Arg Ala Met Leu 

130 135 140 

Ala Asn Arg Leu Ser Tyr Val Leu Gly Thr Arg Gly Pro Ser Leu Ala 

145 150 155 160 

Val Asp Thr Ala Gin Ser Ser Ser Leu Val Ala Val Ala Leu Ala Val 
165 170 175 

Glu Ser Leu Arg Ala Gly Thr Ser Arg Val Ala Val Ala Gly Gly Val 
180 185 190 

Asn Leu Val Leu Ala Asp Glu Gly Thr Ala Ala Met Glu Arg Leu Gly 
195 200 205 

Ala Leu Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn 
210 215 220 

Gly Tyr Val Arg Gly Glu Gly Gly Ala Ala Val Val Leu Lys Pro Leu 
225 230 235 240 

Ala Asp Ala Leu Ala Asp Gly Asp Pro Val Tyr Cys Val Val Arg Gly 
245 250 255 

Val Ala Val Gly Asn Asp Gly Gly Gly Pro Gly Leu Thr Ala Pro Asp 
260 265 270 

Arg Glu Gly Gin Glu Ala Val Leu Arg Ala Ala Cys Ala Gin Ala Arg 
275 280 285 

Val Asp Pro Ala Glu Val Arg Phe Val Glu Leu His Gly Thr Gly Thr 
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290 



295 



300 



Pro Val Gly Asp Pro Val Glu Ala His Ala Leu Gly Ala Val His Gly 
305 310 315 320 

Ser Gly Arg Pro Ala Asp Asp Pro Leu Leu Val Gly Ser Val Lys Thr 
325 330 335 

Asn lie Gly His Leu Glu Gly Ala Ala Gly He Ala Gly Leu Val Lys 
340 345 350 

Ala Ala Leu Cys Leu Arg Glu Arg Thr Leu Pro Gly Ser Leu Asn Phe 
355 360 365 

Ala Thr Pro Ser Pro Ala He Pro Leu Asp Gin Leu Arg Leu Lys Val 
370 375 380 

Gin Thr Ala Ala Ala Glu Leu Pro Leu Ala Pro Gly Gly Ala Pro Leu 
385 390 395 400 

Leu Ala Gly Val Ser Ser Phe Gly He Gly Gly Thr Asn Cys His Val 
405 410 415 

Val Leu Glu His Leu Pro Ser Arg Pro Thr Pro Ala Val Ser Val Ala 
420 425 430 

Ala Ser Leu Pro Asp Val Pro Pro Leu Leu Leu Ser Ala Arg Ser Glu 
435 440 445 

Gly Ala Leu Arg Ala Gin Ala Val Arg Leu Gly Glu Tyr Val Glu Arg 
450 455 460 

Val Gly Ala Asp Pro Arg Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg 
465 470 475 480 

Thr Leu Phe Glu His Arg Ala Val Val Pro Cys Gly Gly Arg Gly Glu 

485 490 495 

Leu Val Ala Ala Leu Gly Gly Phe Ala Ala Gly Arg Val Ser Gly Gly 
500 505 510 

Val Arg Ser Gly Arg Ala Val Pro Gly Gly Val Gly Val Leu Phe Thr 
515 520 525 

Gly Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly 
530 535 540 

Gly Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Met Val Gly 
545 550 555 560 

Glu Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Gly Asp Val Asp 
565 570 575 

Val Asp Ala Gly Ala Gly Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly 
580 585 590 

Val Gly Ser Gly Ser Gly Ser Val Gly Gly Leu Leu Gly Arg Thr Glu 
595 600 605 



Phe Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala 
610 615 620 
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Leu Glu Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val 
625 630 635 640 

Gly Glu Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser Leu Gly Asp 
645 650 655 

Ala Val Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly Gly Leu Pro 
660 665 670 

Val Gly Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser Val Val Arg 
675 680 685 

Gly Val Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala Ala Val Asn 
690 695 700 

Gly Pro Arg Ser Val Val Leu Ser Gly Asp Val Gly Val Leu Glu Ser 
705 710 715 720 

Val Val Ala Ser Leu Met Gly Asp Gly Val Glu Cys Arg Arg Leu Asp 
725 730 735 

Val Ser His Gly Phe His Ser Val Leu Met Glu Pro Val Leu Gly Glu 
740 745 750 

Phe Arg Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val Arg Pro Gly 
755 760 765 

Val Val Val Val Ser Gly Val Ser Gly Gly Val Val Gly Ser Gly Glu 
770 775 780 

Leu Gly Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu Ala Val Arg 
785 790 795 800 

Phe Ala Asp Gly Val Gly Val Val Arg Gly Leu Gly Val Gly Thr Leu 
805 810 815 

Val Glu Val Gly Pro His Gly Val Leu Thr Gly Met Ala Gly Glu Cys 
820 825 830 

Leu Gly Ala Gly Asp Asp Val Val Val Val Pro Ala Met Arg Arg Gly 

835 840 845 

Arg Ala Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr Val Phe Thr 
850 855 860 

Arg Asp Ala Gly Leu Asp Ala Thr Ala Leu His Thr Gly Ser Thr Gly 
865 870 875 880 

Arg Arg lie Asp Leu Pro Thr Tyr Pro Phe Gin Arg Arg Thr His Trp 
885 890 895 

Ser Pro Ala Leu Ser Arg Pro Val Thr Ala Asp Ala Gly Ala Gly Val 
900 905 910 

Thr Ala Thr Asp Ala Val Gly His Ser Val Ser Pro Asp Pro Glu Ser 
915 920 925 



Thr Glu Gly Thr Ser His Arg Asp Thr Asp Asp Glu Ala Asp Ser Ala 
930 935 940 
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Ser Pro Glu Pro Met Ser Pro Glu Asp Ala Val Arg Leu Val Arg Glu 
945 950 955 960 

Ser Thr Ala Ala Val Leu Gly His Asp Asp Pro Gly Glu Val Ala Leu 
965 970 975 

Asp Arg Thr Phe Thr Ser Gin Gly Met Asp Ser Val Thr Ala Val Glu 
980 985 990 

Leu Cys Asp Leu Leu Lys Gly Ala Ser Gly Leu Pro Leu Ala Ala Thr 
995 1000 1005 

Leu Val Tyr Asp Leu Pro Thr Pro Arg Ala Val Ala Glu His lie Val 
1010 1015 1020 

Glu Ala Ala Gly Gly Pro Lys Asp Ser Val Ala Gly Gly Pro Gly Val 
1025 1030 1035 1040 

Leu Ser Ser Ala Ala Val Gly Val Ser Asp Ala Arg Gly Gly Ser Arg 
1045 1050 1055 

Asp Asp Asp Asp Pro lie Ala lie Val Gly Val Gly Cys Arg Leu Pro 
1060 1065 1070 

Gly Gly Val Asp Ser Arg Ala Ala Leu Trp Glu Leu Leu Glu Ser Gly 
1075 1080 1085 

Ala Asp Ala lie Ser Ser Phe Pro Thr Asp Arg Gly Trp Asp Leu Asp 
1090 1095 1100 

Gly Leu Tyr Asp Pro Glu Pro Gly Thr Pro Gly Lys Thr Tyr Val Arg 
1105 1110 1115 1120 

Glu Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe 
1125 1130 1135 

Gly He Ser Pro Arg Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu 
1140 1145 1150 

Leu Leu Glu Ala Ser Trp Glu Ala Leu Glu Asp Ala Gly Val Leu Pro 
1155 1160 1165 

Glu Ser Leu Arg Gly Gly Asp Ala Gly Val Phe Val Gly Ala Thr Ala 
1170 1175 1180 

Pro Glu Tyr Gly Pro Arg Leu His Glu Gly Ala Asp Gly Tyr Glu Gly 
1185 1190 1195 1200 

Tyr Leu Leu Thr Gly Thr Thr Ala Ser Val Ala Ser Gly Arg He Ala 
1205 1210 1215 

Tyr Thr Leu Gly Thr Gly Gly Pro Ala Leu Thr Val Asp Thr Ala Cys 
1220 1225 1230 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ala Leu Arg Arg 
1235 1240 1245 

Gly Glu Cys Gly Leu Ala Leu Ala Gly Gly Ala Thr Val Met Ser Gly 
1250 1255 1260 

Pro Gly Met Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp 
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1265 1270 1275 1280 

Gly Arg Cys Met Pro Phe Ser Ala Asp Ala Asp Gly Thr Ala Trp Ser 
1285 1290 1295 

Glu Gly Val Ala Val Leu Ala Leu Glu Arg Leu Ser Asp Ala Arg Arg 
1300 1305 1310 

Ala Gly His Arg Val Leu Gly Val Val Arg Gly Ser Ala Val Asn Gin 
1315 1320 1325 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Asn Arg Ser Ala Gin Glu 
1330 1335 1340 

Gly Val He Arg Ala Ala Leu Ala Asp Ala Gly Leu Ala Pro Gly Asp 
1345 1350 1355 1360 

Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro 
1365 1370 1375 

lie Glu Ala Ser Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly 
1380 1385 1390 

Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Val Gly His Thr Gin 
1395 1400 1405 

Ala Ala Ala Gly Ala Ala Gly Val Val Lys Met Leu Leu Ala Leu Glu 
1410 1415 1420 

His Gly Thr Leu Pro Arg Thr Leu His Ala Asp Arg Pro Ser Thr His 
1425 1430 1435 1440 

Val Asp Trp Ser Ser Gly Thr Val Ala Leu Leu Ala Glu Ala Arg Arg 
1445 1450 1455 

Trp Pro Arg Arg Ser Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe 
1460 1465 1470 

Gly He Ser Gly Thr Asn Ala His Leu He He Glu Glu Ala Pro Glu 
1475 1480 1485 

Trp Val Glu Asp He Asp Gly Val Ala Ala Pro Asp Arg Gly Thr Ala 
1490 1495 1500 

Asp Ala Ala Ala Pro Ser Pro Leu Leu Leu Ser Ala Arg Ser Glu Gly 
1505 1510 1515 1520 

Ala Leu Arg Ala Gin Ala Val Arg Leu Gly Glu Tyr Val Glu Arg Val 
1525 1530 1535 

Gly Ala Asp Pro Arg Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg Thr 

1540 1545 1550 

Leu Phe Glu His Arg Ala Val Val Pro Cys Gly Gly Arg Gly Glu Leu 
1555 1560 1565 

Val Ala Ala Leu Gly Gly Phe Ala Ala Gly Arg Val Ser Gly Gly Val 
1570 1575 1580 

Arg Ser Gly Arg Ala Val Pro Gly Gly Val Gly Val Leu Phe Thr Gly 
1585 1590 1595 1600 
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Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly 
1605 1610 1615 

Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Met Val Gly Glu 
1620 1625 1630 

Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Gly Asp Val Asp Val 
1635 1640 1645 

Asp Ala Gly Ala Gly Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly Val 
1650 1655 1660 

Gly Ser Gly Ser Gly Ser Val Gly Gly Leu Leu Gly Arg Thr Glu Phe 
1665 1670 1675 1680 

Ala Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu 
1685 1690 1695 

Glu Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val Gly 
1700 1705 1710 

Glu Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala 
1715 1720 1725 

Val Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val 
1730 1735 1740 

Gly Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly 
1745 1750 1755 1760 

Val Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly 
1765 1770 1775 

Pro Arg Ser Val Val Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val 
1780 1785 1790 

Val Ala Ser Leu Met Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val 
1795 1800 1805 

Ser His Gly Phe His Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe 
1810 1815 1820 

Arg Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val 
1825 1830 1835 1840 

Val Val Val Ser Gly Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu 
1845 1850 1855 

Gly Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu Ala Val Arg Phe 
1860 1865 1870 

Ala Asp Gly Val Gly Val Val Arg Gly Leu Gly Val Gly Thr Leu Val 
1875 1880 1885 

Glu Val Gly Pro His Gly Val Leu Thr Gly Met Ala Gly Glu Cys Leu 

1890 1895 1900 

Gly Ala Gly Asp Asp Val Val Val Val Pro Ala Met Arg Arg Gly Arg 

1905 1910 1915 1920 
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Ala Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg 

1925 1930 1935 

Asp Ala Gly Leu Asp Ala Thr Ala Leu His Thr Gly Ser Thr Gly Arg 

1940 1945 1950 

Arg He Asp Leu Pro Thr Tyr Pro Phe Gin Arg Asp Arg Tyr Trp Leu 

1955 i960 1965 

Asp Pro Val Arg Thr Ala Val Thr Gly Val Glu Pro Ala Gly Ser Pro 

1970 1975 1980 

Ala Asp Ala Arg Ala Thr Glu Arg Gly Arg Ser Thr Thr Ala Gly He 

1985 1990 1995 2000 

Arg Tyr Arg Val Ala Trp Gin Pro Ala Val Val Asp Arg Gly Asn Pro 

2005 2010 2015 

Gly Pro Ala Gly His Val Leu Leu Leu Ala Pro Asp Glu Asp Thr Ala 

2020 2025 2030 

Asp Ser Gly Leu Ala Pro Ala He Ala Arg Glu Leu Ala Val Arg Gly 

2035 2040 2045 

Ala Glu Val His Thr Val Ala Val Pro Val Gly Thr Gly Arg Glu Ala 

2050 2055 2060 

Ala Gly Asp Leu Leu Arg Ala Ala Gly Asp Gly Ala Ala Arg Ser Thr 

2065 2070 2075 2080 

Arg Val Leu Trp Leu Ala Pro Ala Glu Pro Asp Ala Ala Asp Ala Val 

2085 2090 2095 

Ala Leu Val Gin Ala Leu Gly Glu Ala Val Pro Glu Ala Pro Leu Trp 

2100 2105 2110 

He Thr Thr Arg Glu Ala Ala Ala Val Arg Pro Asp Glu Thr Pro Ser 

2115 2120 2125 

Val Gly Gly Ala Gin Leu Trp Gly Leu Gly Gin Val Ala Ala Leu Glu 

2130 2135 2140 

Leu Gly Arg Arg Trp Gly Gly Leu Ala Asp Leu Pro Gly Ser Ala Ser 

2145 2150 2155 2160 

Pro Ala Val Leu Arg Thr Phe Val Gly Ala Leu Leu Ala Gly Gly Glu 

2165 2170 2175 

Asn Gin Phe Ala Val Arg Pro Ser Gly Val His Val Arg Arg Val Val 

2180 2185 2190 

Pro Ala Pro Val Pro Val Pro Ala Ser Ala Arg Thr Val Thr Thr Ala 

2195 2200 2205 

Pro Ala Thr Ala Val Gly Glu Asp Ala Arg Asn Asp Thr Ser Asp Val 

2210 2215 2220 

Val Val Pro Asp Asp Arg Trp Ser Ser Gly Thr Val Leu He Thr Gly 

2225 2230 2235 2240 
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Gly Thr Gly Ala Leu Gly Ala Gin Val Ala Arg Arg Leu Ala Arg Ser 

2245 2250 2255 

Gly Ala Ala Arg Leu Leu Leu Val Gly Arg Arg Gly Ala Ala Gly Pro 
2260 2265 2270 

Gly Val Gly Glu Leu Val Glu Glu Leu Thr Ala Leu Gly Ser Glu Val 
2275 2280 2285 

Ala Val Glu Ala Cys Asp Val Ala Asp Arg Asp Ala Leu Ala Ala Leu 
2290 2295 2300 

Leu Ala Gly Leu Pro Glu Glu Arg Pro Leu Val Ala Val Leu His Ala 
2305 2310 2315 2320 

Ala Gly Val Leu Asp Asp Gly Val Leu Asp Ser Leu Thr Ser Asp Arg 
2325 2330 2335 

Val Asp Ala Val Leu Arg Asp Lys Val Thr Ala Ala Arg His Leu Asp 
2340 2345 2350 

Glu Leu Thr Ala Asp Leu Pro Leu Asp Ala Phe Val Leu Phe Ser Ser 
2355 2360 2365 

He Val Gly Val Trp Gly Asn Gly Gly Gin Ala Val Tyr Ala Ala Ala 
2370 2375 2380 

Asn Ala Ala Leu Asp Ala Leu Ala Gin Arg Arg Arg Ala Arg Gly Ala 
2385 2390 2395 2400 

Arg Ala Ala Ser He Ala Trp Gly Pro Trp Ala Gly Ala Gly Met Ala 
2405 2410 2415 

Ser Gly Thr Ala Ala Lys Ser Phe Glu Arg Asp Gly Val Thr Ala Leu 
2420 2425 2430 

Asp Pro Glu Arg Ala Leu Asp Val Leu Asp Asp Val Val Gly Ala Gly 
2435 2440 2445 

Gly Thr Ser Ala Ala Gly Thr His Ala Ala Gly Glu Ser Ser Leu Leu 
2450 2455 2460 

Val Ala Asp Val Asp Trp Glu Thr Phe Val Gly Arg Ser Val Thr Arg 
2465 2470 2475 2480 

Arg Thr Trp Ser Leu Phe Asp Gly Val Ser Ala Ala Arg Ser Ala Arg 
2485 2490 2495 

Ala Gly His Ala Ala Asp Asp Arg Ala Ala Leu Thr Pro Gly Thr Arg 
2500 2505 2510 

Pro Gly Asp Gly Ala Pro Gly Gly Ser Gly Gin Asp Gly Gly Glu Gly 
2515 2520 2525 

Arg Pro Trp Leu Ser Val Gly Pro Ser Pro Ala Glu Arg Arg Arg Ala 
2530 2535 2540 

Leu Leu Thr Leu Val Arg Ser Glu Ala Ala Gly He Leu Arg His Ala 
2545 2550 2555 2560 

Ser Ala Asp Ala Val Asp Pro Glu Leu Ala Phe Arg Ser Ala Gly Phe 
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2565 2570 2575 

Asp Ser Leu Thr Val Leu Glu Leu Arg Asn Arg Leu Thr Ala Ala Thr 
2580 2585 2590 

Cly Lou Asn Leu Pro Asn Thr Leu Leu Phe Asp His Pro Thr Pro Leu 

2595 2600 2605 

Ser Leu Ala Ser His Leu His Asp Glu Leu Phe Gly Pro Asp Ser Glu 
2610 2615 2620 

Ala Glu Pro Ala Ala Ala Ala Pro Thr Pro Val Met Ala Asp Glu Arg 
2625 2630 2635 2640 

Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Val 
2645 2650 2655 

Ala Ser Pro Asp Asp Leu Trp Asp Leu Val Ala Gly Asp Gly His Thr 
2660 2665 2670 

Leu Ser Pro Phe Pro Ala Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr 
2675 2680 2685 

Asp Pro Glu Pro Gly Val Pro Gly Lys Ser Tyr Val Arg Glu Gly Gly 
2690 2695 2700 

Phe Leu Arg Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser 
2705 2710 2715 2720 

Pro Arg Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu 
2725 2730 2735 

Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro Asp Ser Leu 
2740 2745 2750 

Arg Gly Thr Arg Thr Gly Val Phe Ser Gly He Ser Gin Gin Asp Tyr 
2755 2760 2765 

Ala Thr Gin Leu Gly Asp Ala Ala Asp Thr Tyr Gly Gly His Val Leu 
2770 2775 2780 

Thr Gly Thr Leu Gly Ser Val He Ser Gly Arg Val Ala Tyr Ala Leu 
2785 2790 2795 2800 

Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser 
2805 2810 2815 

Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys 
2820 2825 2830 

Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro Thr Val 
2835 2840 2845 

Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys 
2850 2855 2860 

Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Val 
2865 2870 2875 2880 

Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
2885 2890 2895 
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Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
2900 2905 2910 

Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val lie 
2915 2920 2925 

Arg Glu Ala Leu Ala Asp Ala Gly Leu Val Pro Ala Asp Val Asp Val 
2930 2935 2940 

Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala 
2945 2950 2955 2960 

Gly Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu 
2965 2970 2975 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala Ala Ala 
2930 2985 2990 

Gly Val Gly Gly Val He Lys Val Val Gin Gly Met Arg His Gly Ser 
2995 3000 3005 

Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys Val Glu Trp 
3010 3015 3020 

Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Thr Arg Ser Trp Pro Arg 
3025 3030 3035 3040 

Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser 
3045 3050 3055 

Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala Glu Ala Gly 
3060 3065 3070 

Ser Glu His Gly Asp Gly Pro Glu Pro Glu Arg Pro Asp Ala Val Thr 
3075 3080 3085 

Gly Pro Leu Ser Trp Val Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg 
3090 3095 3100 

Ala Gin Ala Val Arg Leu Arg Glu Cys Val Glu Arg Val Gly Ala Asp 
3105 3110 3115 3120 

Pro Arg Asp Val Ala Gly Ser Leu Val Val Ser Arg Ala Ser Phe Gly 
3125 3130 3135 

Glu Arg Ala Val Val Val Gly Arg Gly Arg Glu Glu Leu Leu Ala Gly 
3140 3145 3150 

Leu Asp Val Val Ala Ala Gly Ala Pro Val Gly Val Ser Ser Gly Ala 
3155 3160 3165 

Gly Ala Val Val Arg Gly Ser Ala Val Arg Gly Arg Gly Val Gly Val 
3170 3175 3180 

Leu Phe Thr Gly Gin Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu 
3185 3190 3195 3200 

Tyr Ala Gly Gly Gly Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser 
3205 3210 3215 
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Val Val Gly Glu Val Asp Gly Arg Ser Leu Arg Asp Val Met Phe Ala 
3220 3225 3230 

Asp Ala Asp Ser Val Leu Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala 
3235 3240 3245 

Gin Pro Ala Leu Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu 
3250 3255 3260 

Ala Arg Gly Val Glu Val Ser Val Val Leu Gly His Ser Val Gly Glu 
3265 3270 3275 3280 

Val Ala Ala Ala Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val 
3285 3290 3295 

Arg Leu Val Val Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly 

3300 3305 3310 

Gly Gly Met Trp Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly Val 

3315 3320 3325 

Val Glu Gly Leu Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro 
3330 3335 3340 

Arg Ser Val Val Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val Val 
3345 3350 3355 3360 

Val Thr Leu Met Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser 
3365 3370 3375 

His Gly Phe His Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg 
3380 3385 3390 

Gly Val Val Glu Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val 
3395 3400 3405 

Val Val Ser Gly Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly 
3410 3415 3420 

Asp Pro Gly Tyr Trp Val Arg His Ala Arg Glu Ala Val Arg Phe Ala 
3425 3430 3435 3440 

Asp Gly Val Gly Val Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu 
3445 3450 3455 

Val Gly Pro His Gly Val Leu Thr Gly Met Ala Gly Gin Cys Leu Glu 
3460 3465 3470 

Ala Gly Asp Asp Val Val Val Val Pro Ala Met Arg Arg Gly Arg Pro 
3475 3480 3485 

Glu Arg Glu Val Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp 
3490 3495 3500 

Ala Gly Leu Asp Ala Thr Thr Leu His Thr Gly Ser Thr Gly Arg Arg 
3505 3510 3515 3520 

lie Asp Leu Pro Thr Tyr Pro Phe Gin His Asn Arg Tyr Trp Ala Thr 
3525 3530 3535 

Gly Ser Val Thr Gly Ala Thr Gly Thr Ser Ala Ala Ala Arg Phe Gly 
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3540 3545 3550 

Leu Glu Trp Lys Asp His Pro Phe Leu Ser Gly Ala Thr Pro lie Ala 
3555 3560 3565 

Gly Ser Gly Ala Leu Leu Leu Thr Gly Arg Val Gly Leu Ala Ala His 
3570 3575 3580 

Pro Trp Leu Ala Asp His Ala lie Ser Gly Thr Val Leu Leu Pro Gly 
3585 3590 3595 3600 

Thr Ala He Ala Asp Leu Leu Leu Arg Ala Val Glu Glu Val Gly Ala 
3605 3610 3615 

Gly Gly Val Glu Glu Leu Thr Leu His Glu Pro Leu Leu Leu Pro Glu 

3620 3625 3630 

Arg Gly Gly Leu His Val Gin Val Leu Val Glu Ala Ala Asp Glu Gin 
3635 3640 3645 

Gly Arg Arg Ala Val Ala Val Ala Ala Arg Pro Glu Gly Pro Gly Arg 

3650 3655 3660 

Asp Gly Glu Glu Gin Glu Trp Thr Arg His Ala Glu Gly Val Leu Thr 
3665 3670 3675 3680 

Ser Thr Glu Thr Ala Val Pro Asp Met Gly Trp Ala Ala Gly Ala Trp 
3685 3690 3695 

Pro Pro Pro Gly Ala Glu Pro He Asp Val Glu Glu Leu Tyr Asp Ala 

3700 3705 3710 

Phe Ala Ala Asp Gly Tyr Gly Tyr Gly Pro Ala Phe Thr Ala Leu Ser 
3715 3720 3725 

Gly Val Trp Arg Leu Gly Asp Glu Leu Phe Ala Glu Val Arg Arg Pro 
3730 3735 3740 

Ala Gly Gly Ala Gly Thr Thr Gly Asp Gly Phe Gly Val His Pro Ala 
3745 3750 3755 3760 

Leu Phe Asp Ala Ala Leu His Pro Trp Arg Ala Gly Gly Leu Leu Pro 
3765 3770 3775 

Asp Thr Gly Gly Thr Thr Trp Ala Pro Phe Ser Trp Gin Gly lie Ala 

3780 3785 3790 

Leu His Thr Thr Gly Ala Glu Thr Leu Arg Val Arg Leu Ala Pro Ala 
3795 3800 3805 

Ala Gly Gly Thr Glu Ser Ala Phe Ser Val Gin Ala Ala Asp Pro Ala 
3810 3815 3820 

Gly Thr Pro Val Leu Thr Leu Asp Ala Leu Leu Leu Arg Pro Val Thr 
3825 3830 3835 3840 

Leu Gly Arg Ala Asp Ala Pro Gin Pro Leu Tyr Arg Val Asp Trp Gin 
3845 3850 3855 

Pro Val Gly Gin Gly Thr Glu Ala Ser Gly Ala Gin Gly Trp Thr Val 

3860 3865 3870 
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Leu Gly Gin Ala Ala Ala Glu Thr Val Ala Gin Pro Ala Ala His Ala 
3875 3880 3885 

Asp Leu Thr Ala Leu Arg Thr Ala Val Ala Ala Ala Gly Thr Pro Val 
3890 3895 3900 

Pro Arg Leu Val Val Val Ser Pro Val Asp Thr Arg Leu Asp Glu Gly 
3905 3910 3915 3920 

Pro Val Leu Ala Asp Ala Glu Ala Arg Ala Arg Ala Gly Asp Gly Trp 
3925 3930 3935 

Asp Asp Asp Pro Leu Arg Val Ala Leu Gly Arg Gly Leu Thr Leu Val 
3940 3945 3950 

Arg Glu Trp Val Glu Asp Glu Arg Leu Ala Asp Ser Arg Leu Val Val 
3955 3960 3965 

Leu Thr Arg Gly Ala Val Ala Ala Gly Pro Gly Asp Val Pro Asp Leu 
3970 3975 3980 

Thr Gly Ala Ala Leu Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Tyr 
3985 3990 3995 4000 

Pro Asp Arg Phe Thr Leu He Asp Val Asp Asp Ser Pro Glu Ser Arg 

4005 4010 4015 

Ala Ala Leu Pro Arg Ala Leu Gly Ser Ala Glu Arg Gin Leu Ala Leu 
4020 4025 4030 

Arg Thr Gly Asp Val Leu Ala Pro Ala Leu Val Pro Met Ala Thr Arg 
4035 4040 4045 

Pro Ala Glu Thr Thr Pro Ala Thr Ala Val Ala Ser Ala Thr Thr Gin 
4050 4055 4060 

Thr Gin Val Thr Ala Pro Ala Pro Asp Asp Pro Ala Ala Asp Ala Val 
4065 4070 4075 4080 

Phe Asp Pro Ala Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu 
4085 4090 4095 

Gly Arg Arg Val Ala Ser His Leu Ala Arg Arg Tyr Gly Val Arg His 
4100 4105 4110 

Met Leu Leu Val Ser Arg Arg Gly Pro Asp Ala Pro Glu Ala Gly Pro 
4115 4120 4125 

Leu Glu Arg Glu Leu Ala Gly Leu Gly Val Thr Ala Thr Phe Leu Ala 
4130 4135 4140 

Cys Asp Leu Thr Asp He Glu Ala Val Arg Lys Ala Val Ala Ala Val 
4145 4150 4155 4160 

Pro Ser Asp His Pro Leu Thr Gly Val Val His Thr Ala Gly Val Leu 
4165 4170 4175 

Asp Asp Gly Ala Leu Thr Gly Leu Thr Arg Gin Arg Leu Asp Thr Val 
4180 4185 4190 
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Leu Arg Pro Lys Ala Asp Ala Val Arg Asn Leu His Glu Ala Thr Leu 
4195 4200 4205 

Asp Arg Pro Leu Arg Ala Phe Val Leu Phe Ser Ala Ala Ala Gly Leu 
4210 4215 4220 

Leu Gly Arg Pro Gly Gin Ala Ser Tyr Ala Ala Ala Asn Ala Val Leu 
4225 4230 4235 4240 

Asp Ala Leu Ala Gly Ala Arg Arg Ala Ala Gly Leu Pro Ala Val Ser 
4245 4250 4255 

Leu Ala Trp Gly Leu Trp Asp Glu Gin Thr Gly Met Ala Gly Gly Leu 
4260 4265 4270 

Asp Glu Met Ala Leu Arg Val Leu Arg Arg Asp Gly He Ala Ala Met 
4275 4280 4285 

Pro Pro Glu Gin Gly Leu Glu Leu Leu Asp Leu Ala Leu Thr Gly His 
4290 4295 4300 

Arg Asp Gly Pro Ala Val Leu Val Pro Leu Leu Leu Asp Gly Ala Ala 
4305 4310 4315 4320 

Leu Arg Arg Thr Ala Lys Glu Arg Gly Ala Ala Thr Met Ser Pro Leu 
4325 4330 4335 

Leu Arg Ala Leu Leu Pro Ala Ala Leu Arg Arg Ser Gly Gly Ala Gly 
4340 4345 4350 

Ala Pro Ala Ala Ala Asp Arg His Gly Lys Glu Ala Asp Pro Gly Ala 

4355 4360 4365 

Gly Arg Leu Ala Gly Met Val Ala Leu Glu Ala Ala Glu Arg Ser Ala 
4370 4375 4380 

Ala Val Leu Glu Leu Val Thr Glu Gin Val Ala Glu Val Leu Gly Tyr 
4385 4390 4395 4400 

Ala Ser Ala Ala Glu He Glu Pro Glu Arg Pro Phe Arg Glu He Gly 
4405 4410 4415 

Val Asp Ser Leu Ala Ala Val Glu Leu Arg Asn Arg Leu Ser Arg Leu 
4420 4425 4430 

Val Gly Leu Arg Leu Pro Thr Thr Leu Ser Phe Asp His Pro Thr Pro 
4435 4440 4445 

Lys Asp Met Ala Gin His He Asp Gly Gin Leu Pro Arg Pro Ala Gly 
4450 4455 4460 

Ala Ser Pro Ala Asp Ala Ala Leu Glu Gly He Gly Asp Leu Ala Arg 
4465 4470 4475 4480 

Ala Val Ala Leu Leu Gly Thr Gly Asp Ala Arg Arg Ala Glu Val Arg 
4485 4490 4495 

Glu Gin Leu Val Gly Leu Leu Ala Ala Leu Asp Pro Pro Gly Arg Thr 
4500 4505 4510 

Gly Thr Ala Ala Pro Gly Val Pro Ser Gly Ala Asp Gly Ala Glu Pro 
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4515 4520 4525 

Thr Val Thr Asp Arg Leu Asp Glu Ala Thr Asp Asp Glu He Phe Ala 
4530 4535 4540 

Phe Leu Asp Glu Gin Leu 
4545 4550 
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(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1996 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 



Met Thr Ala Glu Asn Asp Lys He Arg Ser Tyr Leu Lys Arg Ala Thr 
15 10 15 

Ala Glu Leu His Arg Thr Lys Ser Arg Leu Ala Glu Val Glu Ser Ala 
20 25 30 

Ser Arg Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro Gly 
35 40 45 

Gly Val Ala Ser Pro Asp Asp Leu Trp Asp Leu Val Ala Ala Gly Thr 
50 55 60 

Asp Ala Val Ser Ala Phe Pro Val Asp Arg Gly Trp Asp Val Glu Gly 
65 70 75 80 

Leu Tyr Asp Pro Asp Pro Glu Ala Val Gly Arg Ser Tyr Val Arg Glu 
85 90 95 

Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly 
100 105 110 

He Ser Pro Arg Glu Ala Ala Ala Met Asp Pro Gin Gin Arg Leu Leu 
115 120 125 

Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly He Val Pro Ala 
130 135 140 

Ser Leu Arg Gly Thr Arg Thr Gly Val Phe Thr Gly Val Met Tyr Asp 
145 150 155 160 

Asp Tyr Gly Ser Arg Phe Asp Ser Ala Pro Pro Glu Tyr Glu Gly Tyr 
165 170 175 

50 Leu Val Asn Gly Ser Ala Gly Ser He Ala Ser Gly Arg Val Ala Tyr 

180 185 190 

Ala Leu Gly Leu Glu Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser 
195 200 205 



55 



Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly 
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210 



215 



220 



Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Thr Val Met Ala Thr Pro 
225 230 235 240 

Thr Val Leu Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp Gly 
245 250 255 

Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu 
260 265 270 

Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn 
275 280 285 

Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp 
290 295 300 

Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg 
305 310 315 320 

Val He Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val 
325 330 335 

Asp Ala Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro He 
340 345 350 

Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Ser Glu Arg Gin Gly Gin 
355 360 365 

Gly Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin 
370 375 380 

Ala Ala Ala Gly Val Gly Gly Val He Lys Val Val Gin Ala Met Arg 
385 390 395 400 

His Gly Ser Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys 
405 410 415 

Val Glu Trp Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Thr Arg Ser 

420 425 430 

Trp Pro Arg Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 

Gly Val Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Ala 
450 455 460 

Glu Ala Gly Ser Glu His Gly Asp Gly Pro Glu Pro Glu Arg Pro Asp 
465 470 475 480 

Ala Val Thr Gly Pro Leu Ser Trp Val Leu Ser Ala Arg Ser Glu Gly 
485 490 495 

Ala Leu Arg Ala Gin Ala Val Arg Leu Arg Glu Cys Val Glu Arg Val 
500 505 510 

Gly Ala Asp Pro Arg Asp Val Ala Gly Ser Leu Val Val Ser Arg Ala 
515 520 525 



Ser Phe Gly Glu Arg Ala Val Val Val Gly Arg Gly Arg Glu Glu Leu 
530 535 540 



150 



EP 0 791 655 A2 



Leu Ala Gly Leu Asp Val Val Ala Ala 
545 550 



Gly Ala Pro Val Gly Val Ser 
555 560 



Gly Gly Val Ser Ser Gly Ala Gly Ala 
565 



Val Val Arg Gly Ser Ala Val 
570 575 



Arg Gly Arg Gly Val Gly Val Leu Phe 
580 585 



Thr Gly Gin Gly Ala Gin Trp 
590 



Val Gly Met Gly Arg Gly Leu Tyr Ala 
595 600 



Gly Gly Gly Val Phe Ala Glu 
605 



Val Leu Asp Glu Val Leu Ser Val Val 
610 615 



Gly Glu Val Gly Gly Trp Ser 
620 



Leu Arg Asp Val Met Phe Gly Asp Val 
625 630 



Asp Val Asp Ala Gly Ala Gly 
635 640 



Ala Asp Ala Gly Val Gly Ser Gly Val 
645 



Gly Val Gly Gly Leu Leu Gly 
650 655 



Arg Thr Glu Phe Ala Gin Pro Ala Leu 
660 665 



Phe Ala Leu Glu Val Ala Leu 
670 



Phe Arg Ala Leu Glu Ala Arg Gly Val 
675 680 



Glu Val Ser Val Val Leu Gly 
685 



His Ser Val Gly Glu Val Ala Ala Ala 
690 695 



Tyr Val Ala Gly Val Leu Ser 
700 



Leu Gly Asp Ala Val Arg Leu Val Val 
705 710 



Ala Arg Gly Gly Leu Met Gly 
715 720 



Gly Leu Pro Val Gly Gly Gly Met Trp 
725 



Ser Val Gly Ala Ser Glu Ser 
730 735 



Val Val Arg Gly Val Val Glu Gly Leu 
740 745 



Gly Glu Trp Val Ser Val Ala 
750 



Ala Val Asn Gly Pro Arg Ser Val Val 

755 760 



Leu Ser Gly Asp Val Gly Val 
765 



Leu Glu Ser Val Val Ala Ser Leu Met 

770 775 
Arg Leu Asp Val Ser His Gly Phe His 
785 790 



Gly Asp Gly Val Glu Cys Arg 
780 

Ser Val Leu Met Glu Pro Val 
795 800 



Leu Gly Glu Phe Arg Gly Val Val 
805 



Arg Pro Gly Val Val Val Val Ser 
820 



Ser Gly Glu Leu Gly Asp Pro Gly 
835 840 



Ala Val Arg Phe Ala Asp Gly Val 
850 855 



Glu Ser Leu Glu Phe Gly Arg Val 
810 815 

Ser Val Ser Gly Gly Val Val Gly 
825 830 

Tyr Trp Val Arg His Ala Arg Glu 
845 

Gly Val Val Arg Gly Leu Gly Val 
860 
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Gly Thr Leu Val Glu Val Gly Pro 
865 870 

Gly Glu Cys Leu Gly Ala Gly Asp 
885 

Arg Arg Gly Arg Ala Glu Arg Glu 
900 

Val Phe Thr Arg Asp Ala Gly Leu 
915 920 



His Gly Val Leu Thr Gly Met Ala 
875 880 

Asp Val Val Val Val Pro Ala Met 
890 895 

Val Phe Glu Ala Ala Leu Ala Thr 
905 910 

Asp Ala Thr Thr Leu His Thr Gly 
925 



Ser Thr Gly Arg Arg lie Asp Leu Pro Thr Tyr Pro Phe Gin His Asp 
930 935 940 

Arg Tyr Trp Leu Ala Ala Pro Ser Arg Pro Arg Thr Asp Gly Leu Ser 
945 950 955 960 

Ala Ala Gly Leu Arg Glu Val Glu His Pro Leu Leu Thr Ala Ala Val 
965 970 975 

Glu Leu Pro Gly Thr Asp Thr Glu Val Trp Thr Gly Arg He Ser Ala 
980 985 990 

Ala Asp Leu Pro Trp Leu Ala Asp His Leu Val Trp Asp Arg Gly Val 
995 1000 1005 



Val Pro Gly Thr Ala Leu Leu Glu Thr Val Leu Gin Val Gly Ser Arg 
1010 1015 1020 

He Gly Leu Pro Arg Val Ala Glu Leu Val Leu Glu Thr Pro Leu Thr 
1025 1030 1035 1040 

Trp Thr Ser Asp Arg Pro Leu Gin Val Arg He Val Val Thr Ala Ala 
1045 1050 1055 

Ala Thr Ala Pro Gly Gly Ala Arg Glu Leu Thr Leu His Ser Arg Pro 

1060 1065 1070 

Glu Pro Val Ala Ala Ser Ser Ser Ser Pro Ser Pro Ala Ser Pro Arg 
1075 1080 1085 

His Leu Thr Ala Gin Glu Ser Asp Asp Asp Trp Thr Arg His Ala Ser 
1090 1095 1100 

Gly Leu Leu Ala Pro Ala Ala Gly Leu Ala Asp Asp Phe Ala Glu Leu 
1105 1110 1115 1120 

Thr Gly Ala Trp Pro Pro Val Gly Ala Glu Pro Leu Asp Leu Ala Gly 

1125 1130 1135 

Gin Tyr Pro Leu Phe Ala Ala Ala Gly Val Arg Tyr Glu Gly Ala Phe 

1140 1145 1150 



Arg Gly Leu Arg Ala Ala Trp Arg Arg Gly Asp Glu Val Phe Ala Asp 
1155 1160 1165 

Val Arg Leu Pro Asp Ala His Ala Val Asp Ala Asp Arg Tyr Gly Val 
1170 1175 1180 



His Pro Ala Leu Leu Asp Ala Val Leu His Pro He Ala Ser Leu Asp 
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1185 1190 1195 1200 

Pro Leu Gly Asp Gly Gly His Gly Leu Leu Pro Phe Ser Trp Thr Asp 
1205 1210 1215 

Val Gin Gly His Gly Ala Gly Gly His Ala Leu Arg Val Arg Val Ala 
1220 1225 1230 

Ala Val Asp Gly Gly Ala Val Ser Val Thr Ala Ala Asp His Ala Gly 
1235 1240 1245 

Asn Pro Val Leu Ser Ala Arg Ser Leu Ala Leu Arg Arg lie Thr Ala 
1250 1255 1260 

Asp Arg Leu Pro Ala Ala Pro Val Ala Pro Leu Tyr Arg Val Asp Trp 
1265 1270 1275 1280 

Leu Pro Phe Pro Gly Pro Val Pro Val Ser Ala Gly Gly Arg Trp Ala 
1285 1290 1295 

Val Val Gly Pro Glu Ala Glu Ala Thr Ala Ala Gly Leu Arg Ala Val 
1300 1305 1310 

Gly Leu Asp Val Arg Thr His Ala Leu Pro Leu Gly Glu Pro Leu Pro 
1315 1320 1325 

Pro Gin Ala Gly Thr Asp Ala Glu Val lie lie Leu Asp Leu Thr Thr 
1330 1335 1340 

Thr Ala Ala Gly Arg Thr Ala Ser Asp Gly Gly Arg Leu Ser Leu Leu 
1345 1350 1355 1360 

Asp Glu Val Arg Ala Thr Val Arg Arg Thr Leu Glu Ala Val Gin Ala 
1365 1370 1375 

Arg Leu Ala Asp Thr Glu Thr Ala Pro Asp Val Asp Val Arg Thr Ala 
1380 1385 1390 

Ala Arg Pro Arg Thr Ala Ala Arg Thr Ser Pro Arg Val Asp Thr Arg 
1395 1400 1405 

Thr Gly Ala Arg Thr Ala Asp Gly Pro Arg Leu Val Val Leu Thr Arg 
1410 1415 1420 

Gly Ala Ala Gly Pro Glu Gly Gly Ala Ala Asp Pro Ala Gly Ala Ala 
1425 1430 1435 1440 

Val Trp Gly Leu Val Arg Val Ala Gin Ala Glu Gin Pro Gly Arg Phe 
1445 1450 1455 

Thr Leu Val Asp Val Asp Gly Thr Gin Ala Ser Leu Arg Ala Leu Pro 
1460 1465 1470 

Gly Leu Leu Ala Thr Asp Ala Gly Gin Ser Ala Val Arg Asp Gly Arg 

1475 1480 1485 

Val Thr Val Pro Arg Leu Val Pro Val Ala Asp Pro Val Pro His Gly 
1490 1495 1500 

Gly Gly Thr Ala Ala Asp Gly Thr Gly Ala Gly Glu Pro Ser Ala Thr 
1505 1510 1515 1520 
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Leu Asp Pro Glu Gly Thr Val Leu lie Thr Gly Gly Thr Gly Ala Leu 
1525 1530 1535 

Ala Ala Glu Thr Ala Arg His Leu Val Asp Arg His Lys Val Arg His 
1540 1545 1550 

Leu Leu Leu Val Gly Arg Arg Gly Pro Asp Ala Pro Gly Val Asp Arg 
1555 1560 1565 

Leu Val Ala Glu Leu Thr Glu Ser Gly Ala Glu Val Ala Val Arg Ala 
1570 1575 1580 

Cys Asp Val Thr Asp Arg Asp Ala Leu Arg Arg Leu Leu Asp Ala Leu 
1585 1590 1595 1600 

Pro Asp Glu His Pro Leu Thr Cys Val Val His Thr Ala Gly Val Leu 
1605 1610 1615 

Asp Asp Gly Val Leu Ser Ala Gin Thr Ala Glu Arg lie Asp Thr Val 
1620 1625 1630 

Leu Arg Pro Lys Ala Asp Ala Ala Val His Leu Asp Glu Leu Thr Arg 
1635 1640 1645 

Glu lie Gly Arg Val Pro Leu Val Leu Tyr Ser Ser Val Ser Ala Thr 
1650 1655 1660 

Leu Gly Ser Ala Gly Gin Ala Gly Tyr Ala Ala Ala Asn Ala Phe Met 
1665 1670 1675 1680 

Asp Ala Leu Ala Ala Arg Arg Cys Ala Ala Gly His Pro Ala Leu Ser 
1685 1690 1695 

Leu Gly Trp Gly Trp Trp Ser Gly Val Gly Leu Ala Thr Gly Leu Asp 
1700 1705 1710 

Gly Ala Asp Ala Ala Arg Val Arg Arg Ser Gly Leu Ala Pro Leu Asp 
1715 1720 1725 

Ala Gly Ala Ala Leu Asp Leu Leu Asp Arg Ala Leu Thr Arg Pro Glu 
1730 1735 1740 

Pro Ala Leu Leu Pro Val Arg Leu Asp Leu Arg Ala Ala Ala Gly Ala 
1745 1750 1755 1760 

Thr Ala Leu Pro Glu Val Leu Arg Asp Leu Ala Gly Val Pro Ala Asp 
1765 1770 1775 

Ala Arg Ser Thr Pro Gly Ala Ala Ala Gly Thr Gly Asp Glu Asp Gly 
1780 1785 1790 

Ala Val Arg Pro Ala Pro Ala Pro Ala Asp Ala Ala Gly Thr Leu Ala 
1795 1800 1805 

Ala Arg Leu Ala Gly Arg Ser Ala Pro Glu Arg Thr Ala Leu Leu Leu 
1810 1815 1820 

Asp Leu Val Arg Thr Glu Val Ala Ala Val Leu Gly His Gly Asp Pro 
1825 1830 1835 1840 
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Ala Ala He Gly Ala Ala Arg Thr Phe Lys Asp Ala Gly Phe Asp Ser 
1845 1850 1855 

Leu Thr Ala Val Asp Leu Arg Asn Arg Leu Asn Thr Arg Thr Gly Leu 
I860 1865 1870 

Arg Leu Pro Ala Thr Leu Val Phe Asp His Pro Thr Pro Leu Ala Leu 

1875 1880 1885 

Ala Glu Leu Leu Leu Asp Gly Leu Glu Ala Ala Gly Pro Ala Glu Pro 
1890 1895 1900 

Ala Ala Glu Val Pro Asp Glu Ala Ala Gly Ala Glu Thr Leu Ser Gly 
1905 1910 1915 1920 

Val He Asp Arg Leu Glu Arg Ser Leu Ala Ala Thr Asp Asp Gly Asp 
1925 1930 1935 

Ala Arg Val Arg Ala Ala Arg Arg Leu Arg Gly Leu Leu Asp Ala Leu 
1940 1945 1950 

Pro Ala Gly Pro Gly Ala Ala Ser Gly Pro Asp Ala Gly Glu His Ala 

1955 1960 1965 

Pro Gly Arg Gly Asp Val Val He Asp Arg Leu Arg Ser Ala Ser Asp 
1970 1975 1980 

Asp Asp Leu Phe Asp Leu Leu Asp Ser Asp Phe Gin 
1985 1990 1995 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3724 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Ala Thr Asn Glu Glu Lys Leu Arg Glu Tyr Leu Arg Arg Ala 
15 10 15 

Met Ala Asp Leu His Ser Ala Arg Glu Arg Leu Arg Glu Val Glu Ser 
20 25 30 

Ala Ser Arg Glu Pro He Ala He Val Gly Met Ala Cys Arg Tyr Pro 
35 40 45 

Gly Gly Val Ala Ser Pro Glu Glu Leu Trp Asp Leu Val Ala Ala Gly 
50 55 60 

Thr Asp Ala He Ser Pro Phe Pro Val Asp Arg Gly Trp Asp Ala Glu 
65 70 75 80 

Gly Leu Tyr Asp Pro Glu Pro Gly Val Pro Gly Lys Ser Tyr Val Arg 
85 90 95 
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Glu Gly Gly Phe Leu His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe 

100 105 110 

Gly lie Ser Pro Arg Glu Ala Ala Ala Met Asp Pro Gin Gin Arg Leu 
115 120 125 

Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu Arg Ala Gly lie Val Pro 
130 135 140 

Ala Ser Leu Arg Gly Thr Arg Thr Gly Val Phe Thr Gly Val Met Tyr 
145 150 155 160 

His Asp Tyr Gly Ser His Gin Val Gly Thr Ala Ala Asp Pro Ser Gly 
165 170 175 

Gin Leu Gly Leu Gly Thr Ala Gly Ser Val Ala Ser Gly Arg Val Ala 
180 185 190 

Tyr Thr Leu Gly Leu Gin Gly Pro Ala Val Thr Met Asp Thr Ala Cys 
195 200 205 

Ser Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg 
210 215 220 

Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly Ala Thr Val Leu Ala Thr 
225 230 235 240 

Pro Thr Val Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Ala Asp 
245 250 255 

Gly Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala 
260 265 270 

Glu Gly Ala Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg 
275 280 285 

Asn Gly His Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin 
290 295 300 

Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin 
305 310 315 320 

Arg Val lie Arg Asp Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp 

325 330 335 

Val Asp Ala Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro 
340 345 350 

He Glu Ala Gly Ala Leu Met Ala Thr Tyr Gly Ser Glu Arg Val Gly 
355 360 365 

Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr Gin 
370 375 380 

Ala Ala Ala Gly Ala Ala Gly Val He Lys Met Val Gin Ala Leu Arg 
385 390 395 400 



Gin Ser Glu Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ala Lys 
405 410 415 
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Val Glu Trp Asp Ala Gly Ala Val Gin Leu Leu Thr Gly Val Arg Pro 
420 425 430 

Trp Pro Arg Arg Glu His Arg Pro Arg Arg Ala Ala Val Ser Ala Phe 
435 440 445 

Gly Val Ser Gly Thr Asn Ala His Val lie He Glu Glu Pro Pro Ala 

450 455 460 

Ala Gly Asp Thr Ser Pro Ala Gly Asp Thr Pro Glu Pro Gly Glu Ala 
465 470 475 480 

Thr Ala Ser Pro Ser Thr Ala Ala Gly Pro Ser Ser Pro Ser Ala Val 
485 490 495 

Ala Gly Pro Leu Ser Pro Ser Ser Pro Ala Val Val Trp Pro Leu Ser 
500 505 510 

Ala Glu Thr Ala Pro Ala Leu Arg Ala Gin Ala Ala Arg Leu Arg Ala 
515 520 525 

His Leu Glu Arg Leu Pro Gly Thr Ser Pro Thr Asp He Gly His Ala 
530 535 540 

Leu Ala Ala Glu Arg Ala Ala Leu Thr Arg Arg Val Val Leu Leu Gly 
545 550 555 560 

Asp Asp Gly Ala Pro Val Asp Ala Leu Ala Ala Leu Ala Ala Gly Glu 
565 570 575 

Thr Thr Pro Asp Ala Val His Gly Thr Ala Ala Asp He Arg Arg Val 
580 585 590 

Ala Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Ala Gly Met Gly Ala 
595 600 605 

Glu Leu Leu Asp Thr Ala Pro Ala Phe Ala Ala Glu Leu Asp Arg Cys 
610 615 620 

Gin Gly Ala Leu Ser Pro Tyr Val Asp Trp Asn Leu Ala Asp Val Leu 
625 630 635 640 

Arg Gly Ala Pro Ala Ala Pro Gly Leu Asp Arg Val Asp Val Val Gin 
645 650 655 

Pro Ala Thr Phe Ala Val Met Val Gly Leu Ala Ala Leu Trp Arg Ser 
660 665 670 

Leu Gly Val Glu Pro Ala Ala Val He Gly His Ser Gin Gly Glu He 
675 680 685 

Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg 
690 695 700 

He Val Ala Leu Arg Ser Gin Val He Ala Arg Glu Leu Ala Gly Arg 
705 710 715 720 

Gly Gly Met Ala Ser Val Ala Leu Pro Ala Ala Glu Val Glu Ala Arg 
725 730 735 



Leu Ala Gly Gly Val Glu He Ala Ala Val Asn Gly Pro Gly Ser Thr 
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740 745 750 

Val Val Cys Gly Glu Pro Gly Ala Leu Glu Ala Leu Leu Val Thr Leu 
755 760 765 

Glu Ser Glu Gly Thr Arg Val Arg Arg He Asp Val Asp Tyr Ala Ser 
770 775 780 

His Ser His Tyr Val Glu Ser He Arg Ala Glu Leu Ala Thr Val Leu 
785 790 795 800 

Gly Pro Val Arg Pro Arg Arg Gly Asp Val Pro Phe Tyr Ser Thr Val 

805 810 815 

Glu Ala Ala Leu Leu Asp Thr Ala Thr Leu Asp Ala Asp Tyr Trp Tyr 
820 825 830 

Arg Asn Leu Arg Leu Pro Val Arg Phe Glu Pro Thr Val Arg Ala Met 
835 840 845 

Leu Asp Asp Gly Val Asp Ala Phe Val Glu Cys Ser Ala His Pro Val 
850 855 860 

Leu Thr Val Gly Val Arg Gin Thr Val Glu Ser Ala Gly Gly Ala Val 
865 870 875 880 

Pro Ala Leu Ala Ser Leu Arg Arg Asp Glu Gly Gly Leu Arg Arg Phe 
885 890 895 

Leu Thr Ser Ala Ala Glu Ala Gin Val Val Gly Val Pro Val Asp Trp 
900 905 910 

Ala Thr Leu Arg Pro Gly Ala Gly Arg Val Asp Leu Pro Thr Tyr Ala 
915 920 925 

Phe Gin Arg Glu Arg His Trp Val Gly Pro Ala Arg Pro Asp Ser Ala 
930 935 940 

Ala Thr Ala Ala Thr Thr Gly Asp Asp Ala Pro Glu Pro Gly Asp Arg 
945 950 955 960 

Leu Gly Tyr His Val Ala Trp Lys Gly Leu Arg Ser Thr Thr Gly Gly 
965 970 975 

Trp Arg Pro Gly Leu Arg Leu Leu He Val Pro Thr Gly Asp Gin Tyr 
980 985 990 

Thr Ala Leu Ala Asp Thr Leu Glu Gin Ala Val Ala Ser Phe Gly Gly 
995 1000 1005 

Thr Val Arg Arg Val Ala Phe Asp Pro Ala Arg Thr Gly Arg Ala Glu 
1010 1015 1020 

Leu Phe Gly Leu Leu Glu Thr Glu He Asn Gly Asp Thr Ala Val Thr 
1025 1030 1035 1040 

Gly Val Val Ser Leu Leu Gly Leu Cys Thr Asp Gly Arg Pro Asp His 
1045 1050 1055 

Pro Ala Val Pro Val Ala Val Thr Ala Thr Leu Ala Leu Val Gin Ala 
1060 1065 1070 
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Leu Ala Asp Leu Gly Ser Thr Ala Pro Leu Trp Thr Val Thr Cys Gly 
1075 1080 1085 



Ala Val Ala 


Thr 


Ala 


Pro Asp Glu 


Leu 


Pro Cys 


Thr 


Ala 


Gly 


Ala 


Gin 


1090 






1095 






1100 








Leu Trp Gly 


Leu 


Gly Arg Val Ala 


Ala 


Leu Glu 


Leu 


Pro 


Glu 


Val 


Trp 


1105 






1110 




1115 








1120 


Gly Gly Leu 


lie 


Asp 


Leu Pro Ala 


Arg 


Pro Asp 


Ala 


Arg 


Val 


Leu 


Asp 






1125 




1130 








1135 


Arg Leu Ala 


Gly 


Val 


Leu Ala Glu 


Pro Gly Gly 


Glu 


Asp 


Gin 


He 


Ala 




1140 




1145 






1150 




Val Arg Met 


Ala 


Gly 


Val Phe Gly 


Arg 


Arg Val 


Leu 


Arg 


Asn 


Pro 


Ala 


1155 




1160 






1165 






Asp Ser Arg 


Pro 


Pro 


Ala Trp Arg 


Ala 


Arg Gly 


Thr 


Val 


Leu 


He 


Ala 


1170 






1175 






1180 








Gly Asp Leu 


Thr 


Thr 


Val Pro Gly 


Arg 


Leu Val 


Arg 


Ser 


Leu 


Leu 


Glu 


1185 






1190 




1195 








1200 


Asp Gly Ala 


Asp 


Arg 


Val Val Leu 


Ala 


Gly Pro Asp Ala 


Pro 


Ala 


Gin 






1205 




1210 








1215 


Ala Ala Ala 


Ala Gly 


Leu Thr Gly 


Val 


Ser Leu 


Val 


Pro 


Val 


Arg 


Cys 




1220 




1225 






1230 




Asp Val Thr 


Asp 


Arg 


Ala Ala Leu 


Ala 


Ala Leu 


Leu 


Asp 


Glu 


His 


Ala 


1235 




1240 






1245 






Pro Thr Val 


Ala 


Val 


His Ala Pro 


Pro 


Leu Val 


Pro 


Leu 


Ala 


Pro 


Leu 


1250 






1255 






1260 








Arg Glu Thr 


Ala 


Pro 


Gly Asp He 


Ala 


Ala Ala 


Leu 


Ala 


Ala 


Lys 


Thr 



1265 1270 1275 1280 

Thr Ala Ala Gly His Leu Val Asp Leu Ala Pro Ala Ala Gly Leu Asp 
1285 1290 1295 

Ala Leu Val Leu Phe Ser Ser Val Ser Gly Val Trp Gly Gly Ala Ala 
1300 1305 1310 

Gin Gly Gly Tyr Ala Ala Ala Ser Ala His Leu Asp Ala Leu Ala Glu 
1315 1320 1325 

Arg Ala Arg Ala Ala Gly Val Pro Ala Phe Ser Val Ala Trp Ser Pro 
1330 1335 1340 

Trp Ala Gly Gly Thr Pro Ala Asp Gly Ala Glu Ala Glu Phe Leu Ser 
1345 1350 1355 1360 

Arg Arg Gly Leu Ala Pro Leu Asp Pro Asp Gin Ala Val Arg Thr Leu 
1365 1370 1375 

Arg Arg Met Leu Glu Arg Gly Ser Ala Cys Gly Ala Val Ala Asp Val 
1380 1385 1390 
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Glu Trp Ser Arg Phe Ala Ala Ser Tyr Thr Trp Val Arg Pro Ala Val 
1395 1400 1405 

Leu Phe Asp Asp He Pro Asp Val Gin Arg Leu Arg Ala Ala Glu Leu 
1410 1415 1420 

Ala Pro Ser Thr Gly Asp Ser Thr Thr Ser Glu Leu Val Arg Glu Leu 
1425 1430 1435 1440 

Thr Ala Gin Ser Gly His Lys Arg His Ala Thr Leu Leu Arg Leu Val 
1445 1450 1455 

Arg Ala His Ala Ala Ala Val Leu Gly Gin Ser Ser Gly Asp Ala Val 
1460 1465 1470 

Ser Ser Ala Arg Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Thr Ala 
1475 1480 1485 

Leu Glu Leu Arg Asp Arg Leu Ser Thr Ser Thr Gly Leu Lys Leu Pro 
1490 1495 1500 

Thr Ser Leu Val Phe Asp His Ser Ser Pro Ala Ala Leu Ala Arg His 
1505 1510 1515 1520 

Leu Gly Glu Glu Leu Leu Gly Arg Asn Asp Thr Ala Asp Arg Ala Gly 

1525 1530 1535 

Pro Asp Thr Pro Val Arg Thr Asp Glu Pro He Ala He He Gly Met 
1540 1545 1550 

Ala Cys Arg Leu Pro Gly Gly Val Gin Ser Pro Glu Asp Leu Trp Asp 
1555 1560 1565 

Leu Leu Thr Gly Gly Thr Asp Ala He Thr Pro Phe Pro Thr Asn Arg 
1570 1575 1580 

Gly Trp Asp Asn Glu Thr Leu Tyr Asp Pro Asp Pro Asp Ser Pro Gly 
1585 1590 1595 1600 

His His Thr Tyr Val Arg Glu Gly Gly Phe Leu His Asp Ala Ala Glu 
1605 1610 1615 

Phe Asp Pro Gly Phe Phe Gly He Ser Pro Arg Glu Ala Leu Ala Met 
1620 1625 1630 

Asp Pro Gin Gin Arg Leu He Leu Glu Thr Ser Trp Glu Ser Phe Glu 
1635 1640 1645 

Arg Ala Gly He Asp Pro Val Glu Leu Arg Gly Ser Arg Thr Gly Val 
1650 1655 1660 

Phe Val Gly Thr Asn Gly Gin His Tyr Val Pro Leu Leu Gin Asp Gly 
1665 1670 1675 1680 

Asp Glu Asn Phe Asp Gly Tyr He Ala Thr Gly Asn Ser Ala Ser Val 
1685 1690 1695 

Met Ser Gly Arg Leu Ser Tyr Val Phe Gly Leu Glu Gly Pro Ala Val 
1700 1705 1710 

Thr Val Asp Thr Ala Cys Ser Ala Ser Leu Ala Ala Leu His Leu Ala 
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1715 1720 1725 

Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Tyr Ala Leu Ala Gly Gly 
1730 1735 1740 

Ala Thr Val Met Ser Thr Pro Glu Met Leu Val Glu Phe Ala Arg Gin 
1745 1750 1755 1760 

Arg Ala Val Ser Pro Asp Gly Arg Ser Lys Ala Phe Ala Glu Ala Ala 
1765 1770 1775 

Asp Gly Val Gly Leu Ala Glu Gly Ala Gly Met Leu Leu Val Glu Arg 
1780 1785 1790 

Leu Ser Glu Ala Gin Lys Lys Gly His Pro Val Leu Ala Val Val Arg 
1795 1800 1805 

Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
1810 1815 1820 

Ser Gly Pro Ala Gin Gin Arg Val lie Arg Glu Ala Leu Ala Asp Ala 
1825 1830 1835 1840 

Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr Gly 
1845 1850 1855 

Thr Pro Leu Gly Asp Pro lie Glu Ala Gly Ala Leu Leu Ala Thr Tyr 

1860 1865 1870 

Gly Arg Asp Arg Arg Asp Gly Pro Leu Trp Leu Gly Ser Leu Lys Ser 

1875 1880 1885 

Asn He Gly His Thr Gin Ala Ala Ala Gly Val Ala Gly Val He Lys 
1890 1895 1900 

Met Val Leu Ala Leu Arg His Gly Glu Leu Pro Arg Thr Leu His Ala 
1905 1910 1915 1920 

Ser Thr Ala Ser Ser Arg He Asp Trp Asp Ala Gly Ala Val Glu Leu 
1925 1930 1935 

Leu Asp Glu Ala Arg Pro Trp Leu Gin Arg Ala Glu Gly Pro Arg Arg 
1940 1945 1950 

Ala Gly He Ser Ser Phe Gly He Ser Gly Thr Asn Ala His Leu Val 
1955 1960 1965 

He Glu Glu Pro Pro Glu Pro Thr Ala Pro Glu Leu Leu Ala Pro Glu 
1970 1975 1980 

Pro Ala Ala Asp Gly Asp Val Trp Ser Glu Glu Trp Trp His Glu Val 
1985 1990 1995 2000 

Thr Val Pro Leu Met Met Ser Ala His Asn Glu Ala Ala Leu Arg Asp 
2005 2010 2015 

Gin Ala Arg Arg Leu Arg Ala Asp Leu Leu Ala His Pro Glu Leu His 
2020 2025 2030 

Pro Ala Asp Val Gly Tyr Thr Leu He Thr Thr Arg Thr Arg Phe Glu 
2035 2040 2045 
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Gin Arg Ala Ala Val Val Gly Glu Asn Phe Thr Glu Leu He Ala Ala 
2050 2055 2060 

Leu Asp Asp Leu Val Glu Gly Arg Pro His Pro Leu Val Leu Arg Gly 
2065 2070 2075 2080 

Thr Ala Gly Thr Ser Asp Gin Val Val Phe Val Phe Pro Gly Gin Gly 
2085 2090 2095 

Ser Gin Trp Pro Glu Met Ala Asp Gly Leu Leu Ala Arg Ser Ser Gly 
2100 2105 2110 

Ser Gly Ser Phe Leu Glu Thr Ala Arg Ala Cys Asp Leu Ala Leu Arg 
2115 2120 2125 

Pro His Leu Gly Trp Ser Val Leu Asp Val Leu Arg Arg Glu Pro Gly 
2130 2135 2140 

Ala Pro Ser Leu Asp Arg Val Asp Val Val Gin Pro Val Leu Phe Thr 
2145 2150 2155 2160 

Met Met Val Ser Leu Ala Glu Thr Trp Arg Ser Leu Gly Val Glu Pro 
2165 2170 2175 

Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Tyr Val 
2180 2185 2190 

Ala Gly Ala Leu Thr Leu Asp Asp Ala Ala Arg He Val Ala Leu Arg 
2195 2200 2205 

Ser Gin Ala Trp Leu Arg Leu Ala Gly Lys Gly Gly Met Val Ala Val 

2210 2215 2220 

Thr Leu Ser Glu Arg Asp Leu Arg Pro Arg Leu Glu Pro Trp Ser Asp 
2225 2230 2235 2240 

Arg Leu Ala Val Ala Ala Val Asn Gly Pro Glu Thr Cys Ala Val Ser 
2245 2250 2255 

Gly Asp Pro Asp Ala Leu Ala Glu Leu Val Ala Glu Leu Gly Ala Glu 
2260 2265 2270 

Gly Val His Ala Arg Pro He Pro Gly Val Asp Thr Ala Gly His Ser 
2275 2280 2285 

Pro Gin Val Asp Thr Leu Glu Ala His Leu Arg Lys Val Leu Ala Pro 
2290 2295 2300 

Val Ala Pro Arg Thr Ser Asp He Pro Phe Tyr Ser Thr Val Thr Gly 
2305 2310 2315 2320 

Gly Leu He Asp Thr Ala Glu Leu Asp Ala Asp Tyr Trp Tyr Arg Asn 
2325 2330 2335 

Met Arg Glu Pro Val Glu Phe Glu Gin Ala Thr Arg Ala Leu He Ala 
2340 2345 2350 

Asp Gly His Asp Val Phe Leu Glu Ser Ser Pro His Pro Met Leu Ala 
2355 2360 2365 
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Val Ser Leu Gin Glu Thr He Ser Asp Ala Gly Ser Pro Ala Ala Val 
2370 2375 2380 

Leu Gly Thr Leu Arg Arg Gly Gin Gly Gly Pro Arg Trp Leu Gly Val 
2385 2390 2395 2400 

Ala Leu Cys Arg Ala Tyr Thr His Gly Leu Glu He Asp Ala Glu Ala 
2405 2410 2415 

He Phe Gly Pro Asp Ser Arg Gin Val Glu Leu Pro Thr Tyr Pro Phe 
2420 2425 2430 

Gin Arg Glu Arg Tyr Trp Tyr Ser Pro Gly His Arg Gly Asp Asp Pro 
2435 2440 2445 

Ala Ser Leu Gly Leu Asp Ala Val Asp His Pro Leu Leu Gly Ser Gly 
2450 2455 2460 

Val Glu Leu Pro Glu Ser Gly Asp Arg Met Tyr Thr Ala Arg Leu Gly 
2465 2470 2475 2480 

Ala Asp Thr Thr Pro Trp Leu Ala Asp His Ala Leu Leu Gly Ser Pro 
2485 2490 2495 

Leu Leu Pro Gly Ala Ala Phe Ala Asp Leu Ala Leu Trp Ala Gly Arg 
2500 2505 2510 

Gin Ala Gly Thr Gly Arg Val Glu Glu Leu Thr Leu Ala Ala Pro Leu 
2515 2520 2525 

Val Leu Pro Gly Ser Gly Gly Val Arg Leu Arg Leu Asn Val Gly Ala 
2530 2535 2540 

Pro Gly Thr Asp Asp Ala Arg Arg Phe Ala Val His Ala Arg Ala Glu 
2545 2550 2555 2560 

Gly Ala Thr Asp Trp Thr Leu His Ala Glu Gly Leu Leu Thr Ala Gin 

2565 2570 2575 

Asp Thr Ala Asp Ala Pro Asp Ala Ser Ala Ala Thr Pro Pro Pro Gly 
2580 2585 2590 

Ala Glu Gin Leu Asp He Gly Asp Phe Tyr Gin Arg Phe Ser Glu Leu 
2595 2600 2605 

Gly Tyr Gly Tyr Gly Pro Phe Phe Arg Gly Leu Val Ser Ala His Arg 
2610 2615 2620 

Cys Gly Pro Asp He His Ala Glu Val Ala Leu Pro Val Gin Ala Gin 
2625 2630 2635 2640 

Gly Asp Ala Ala Arg Phe Gly He His Pro Ala Leu Leu Asp Ala Ala 
2645 2650 2655 

Leu Gin Thr Met Ser Leu Gly Gly Phe Phe Pro Glu Asp Gly Arg Val 
2660 2665 2670 

Arg Met Pro Phe Ala Leu Arg Gly Val Arg Leu Tyr Arg Ala Gly Ala 
2675 2680 2685 

Asp Arg Leu His Val Arg Val Ser Pro Val Ser Glu Asp Ala Val Arg 
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2690 2695 2700 

lie Arg Cys Ala Asp Gly Glu Gly Arg Pro Val Ala Glu lie Glu Ser 
2705 2710 2715 2720 

Phe lie Met Arg Pro Val Asp Pro Gly Gin Leu Leu Gly Gly Arg Pro 
2725 2730 2735 

Val Gly Ala Asp Ala Leu Phe Arg lie Ala Trp Arg Glu Leu Ala Ala 
2740 2745 2750 

Gly Pro Gly Thr Arg Thr Gly Asp Gly Thr Pro Pro Pro Val Arg Trp 
2755 2760 2765 

Val Leu Ala Gly Pro Asp Ala Leu Gly Leu Ala Glu Ala Ala Asp Ala 
2770 2775 2780 

His Leu Pro Ala Val Pro Gly Pro Asp Gly Ala Leu Pro Ser Pro Thr 
2785 2790 2795 2800 

Gly Arg Pro Ala Pro Asp Ala Val Val Phe Ala Val Arg Ala Gly Thr 
2805 2810 2815 

Gly Asp Val Ala Ala Asp Ala His Thr Val Ala Cys Arg Val Leu Asp 
2820 2825 2830 

Leu Val Gin Arg Arg Leu Ala Ala Pro Glu Gly Pro Asp Gly Ala Arg 
2835 2840 2845 

Leu Val Val Ala Thr Arg Gly Ala Val Ala Val Arg Asp Asp Ala Glu 
2850 2855 2860 

Val Asp Asp Pro Ala Ala Ala Ala Ala Trp Gly Leu Leu Arg Ser Ala 
2865 2870 2875 2880 

Gin Ala Glu Glu Pro Gly Arg Phe Leu Leu Val Asp Leu Asp Asp Asp 
2885 2890 2895 

Pro Ala Ser Ala Arg Ala Leu Thr Asp Ala Leu Ala Ser Gly Glu Pro 
2900 2905 2910 

Gin Thr Ala Val Arg Ala Gly Thr Val Tyr Val Pro Arg Leu Glu Arg 

2915 2920 2925 

Ala Ala Asp Arg Thr Asp Gly Pro Leu Thr Pro Pro Asp Asp Gly Ala 
2930 2935 2940 

Trp Arg Leu Gly Arg Gly Thr Asp Leu Thr Leu Asp Gly Leu Ala Leu 
2945 2950 2955 2960 

Val Pro Ala Pro Asp Ala Glu Ala Pro Leu Glu Pro Gly Gin Val Arg 
2965 2970 2975 

Val Ala Val Arg Ala Ala Gly Val Asn Phe Arg Asp Ala Leu He Ala 
2980 2985 2990 

Leu Gly Met Tyr Pro Gly Glu Ala Glu Met Gly Thr Glu Gly Ala Gly 
2995 3000 3005 

Thr Val Val Glu Val Gly Pro Gly Val Thr Gly Val Ala Val Gly Asp 
3010 3015 3020 
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Arg Val Leu Gly Leu Trp Asp Gly Gly Leu Gly Pro Leu Cys Val Ala 
3025 3030 3035 3040 

Asp His Arg Leu Leu Ala Pro Val Pro Asp Gly Trp Ser Tyr Ala Gin 
3045 3050 3055 

Ala Ala Ser Val Pro Ala Val Phe Leu Ser Ala Tyr Tyr Gly Leu Val 
3060 3065 3070 

Thr Leu Ala Gly Leu Arg Pro Gly Glu Arg Val Leu Val His Ala Ala 
3075 3080 3085 

Ala Gly Gly Val Gly Met Ala Ala Val Gin He Ala Arg His Leu Gly 
3090 3095 3100 

Ala Glu Val Leu Ala Thr Ala Ser Pro Gly Lys Trp Asp Ala Leu Arg 
3105 3110 3115 3120 

Ala Met Gly He Thr Asp Asp His Leu Ala Ser Ser Arg Thr Leu Asp 
3125 3130 3135 

Phe Ala Thr Ala Phe Thr Gly Ala Asp Gly Thr Ser Arg Ala Asp Val 
3140 3145 3150 

Val Leu Asn Ser Leu Thr Lys Glu Phe Val Asp Ala Ser Leu Gly Leu 
3155 3160 3165 

Leu Arg Pro Gly Gly Arg Phe Leu Glu Leu Gly Lys Thr Asp Val Arg 
3170 3175 3180 

Asp Pro Glu Arg He Ala Ala Glu His Pro Gly Val Arg Tyr Arg Ala 
3185 3190 3195 3200 

Phe Asp Leu Asn Glu Ala Gly Pro Asp Ala Leu Gly Arg Leu Leu Arg 
3205 3210 3215 

Glu Leu Met Asp Leu Phe Ala Ala Gly Val Leu His Pro Leu Pro Val 
3220 3225 3230 

Val Thr His Asp Val Arg Arg Ala Ala Asp Ala Leu Arg Thr He Ser 
3235 3240 3245 

Gin Ala Arg His Thr Gly Lys Leu Val Leu Thr Met Pro Pro Ala Trp 
3250 3255 3260 

His Pro Tyr Gly Thr Val Leu Val Thr Gly Gly Thr Gly Ala Leu Gly 
3265 3270 3275 3280 

Ser Arg He Ala Arg His Leu Ala Ser Arg His Gly Val Arg Arg Leu 
3285 3290 3295 

Leu He Ala Ala Arg Arg Gly Pro Asp Gly Glu Gly Ala Ala Glu Leu 
3300 3305 3310 

Val Ala Asp Leu Ala Ala Leu Gly Ala Ser Ala Thr Val Val Ala Cys 
3315 3320 3325 

Asp Val Ser Asp Ala Asp Ala Val Arg Gly Leu Leu Ala Gly He Pro 
3330 3335 3340 
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Ala Asp His Pro Leu Thr Ala Val Val His Ser Thr Gly Val Leu Asp 

3345 3350 3355 3360 

Asp Gly Val Leu Pro Gly Leu Thr Pro Glu Arg Met Arg Arg Val Leu 

3365 3370 3375 

Arg Pro Lys Val Glu Ala Ala Val His Leu Asp Glu Leu Thr Arg Asp 

3380 3385 3390 

Leu Asp Leu Ser Ala Phe Val Leu Phe Ser Ser Ser Ala Gly Leu Leu 

3395 3400 3405 

Gly Ser Pro Ala Gin Gly Asn Tyr Ala Ala Ala Asn Ala Thr Leu Asp 
3410 3415 3420 

Ala Leu Ala Ala Arg Arg Arg Ser Leu Gly Leu Pro Ser Val Ser Leu 

3425 3430 3435 3440 

Ala Trp Gly Leu Trp Ser Asp Thr Ser Arg Met Ala His Ala Leu Asp 

3445 3450 3455 

Gin Glu Ser Leu Gin Arg Arg Phe Ala Arg Ser Gly Phe Pro Pro Leu 

3460 3465 3470 

Ser Ala Thr Leu Gly Ala Ala Leu Phe Asp Ala Ala Leu Arg Val Asp 

3475 3480 3485 

Glu Ala Val Gin Val Pro Met Arg Phe Asp Pro Ala Ala Leu Arg Ala 
3490 3495 3500 

Thr Gly Ser Val Pro Ala Leu Leu Ser Asp Leu Val Gly Ser Ala Pro 

3505 3510 3515 3520 

Ala Thr Gly Ser Ala Ala Pro Ala Ser Gly Pro Leu Pro Ala Pro Asp 

3525 3530 3535 

Ala Gly Thr Val Gly Glu Pro Leu Ala Glu Arg Leu Ala Gly Leu Ser 

3540 3545 3550 

Ala Glu Glu Arg His Asp Arg Leu Leu Gly Leu Val Gly Glu His Val 

3555 3560 3565 

Ala Ala Val Leu Gly His Gly Ser Ala Ala Glu Val Arg Pro Asp Arg 
3570 3575 3580 

Pro Phe Arg Glu Val Gly Phe Asp Ser Leu Thr Ala Val Glu Leu Arg 

3585 3590 3595 3600 

Asn Arg Met Ala Ala Val Thr Gly Val Arg Leu Pro Ala Thr Leu Val 

3605 3610 3615 

Phe Asp His Pro Thr Pro Ala Ala Leu Ser Ser His Leu Asp Gly Leu 

3620 3625 3630 

Leu Ala Pro Ala Gin Pro Val Thr Thr Thr Pro Leu Leu Ser Glu Leu 

3635 3640 3645 

Asp Arg lie Glu Glu Ala Leu Ala Ala Leu Thr Pro Glu His Leu Ala 
3650 3655 3660 

Glu Leu Ala Pro Ala Pro Asp Asp Arg Ala Glu Val Ala Leu Arg Leu 
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3665 3670 3675 3680 

Asp Ala Leu Ala Asp Arg Trp Arg Ala Leu His Asp Gly Ala Pro Gly 
3685 3690 3695 

5 

Ala Asp Asp Asp lie Thr Asp Val Leu Ser Ser Ala Asp Asp Asp Glu 
3700 3705 3710 

lie Phe Ala Phe He Asp Glu Arg Tyr Gly Thr Ser 
10 3715 3720 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 
75 (A) LENGTH: 1580 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : peptide 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



30 



35 



45 



55 



Met Ala 
1 


Asn 


Glu 


Glu 
5 


Lys 


Leu 


Arg 


Ala 


Tyr 
10 


Leu 


Lys 


Arg 


Val 


Thr Gly 
15 


Glu Leu 


His 


Arg 
20 


Ala 


Thr 


Glu 


Gin 


Leu 
25 


Arg 


Ala 


Leu 


Asp 


Arg 
30 


Arg Ala 


His Glu 


Pro 
35 


He 


Ala 


He 


Val 


Gly 
40 


Ala 


Ala 


Cys 


Arg 


Leu 
45 


Pro 


Gly Gly 


Val Glu 
50 


Ser 


Pro 


Asp 


Asp 


Leu 
55 


Trp 


Glu 


Leu 


Leu 


His 
60 


Ala Gly 


Ala Asp 


Ala Val 
65 


Gly 


Pro 


Ala 


Pro 
70 


Ala 


Asp 


Arg 


Gly 


Trp 

75 


Asp Val 


Glu 


Gly Arg 
80 


Tyr Ser 


Pro 


Asp 


Pro 
85 


Asp 


Thr 


Pro 


Gly 


Thr 
90 


Ser 


Tyr 


Cys 


Arg 


Glu Gly 
95 


Gly Phe 


Val 


Gin 
100 


Gly 


Ala 


Asp 


Arg 


Phe 
105 


Asp 


Pro 


Ala 


Leu 


Phe 
110 


Gly He 


Ser Pro 


Asn 

115 


Glu 


Ala 


Leu 


Thr 


Met 
120 


Asp 


Pro 


Gin 


Gin 


Arg 
125 


Leu 


Leu Leu 


Glu Thr 
130 


Ser 


Trp 


Glu 


Ala 


Leu 
135 


Glu 


Arg 


Ala 


Gly 


Leu 
140 


Asp 


Pro 


Gin Ser 


Leu Ala Gly 
145 


Ser 


Arg 


Thr 
150 


Gly 


Val 


Phe 


Ala 


Gly Ala 
155 


Trp Glu 


Ser Gly 
160 


Tyr Gin Lys 
Leu Ala Gly 


Gly 

He 
180 


Val 
165 
Val 


Glu 
Ser 


Gly 
Phe 


Leu 
Thr 


Glu 

Ala 
185 


Ala 
170 
Gly 


Asp Leu 
Arg Val 


Glu Ala 

Ala Tyr 
190 


Gin Leu 
175 

Ala Leu 


Gly Leu Glu 


Gly 


Pro 


Ala 


Leu 


Thr 


He 


Asp 


Thr 


Ala 


Cys 


Ser 


Ser Ser 
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195 



200 



205 



Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys 
210 215 220 

Asp Leu Ala Leu Ala Gly Gly Ala Thr Val lie Ala Asp Phe Ala Leu 
225 230 235 240 

Phe Thr Gin Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys 
245 250 255 

Lys Ala Phe Gly Glu Thr Ala Asp Gly Phe Gly Pro Ala Glu Gly Ala 
260 265 270 

Gly Met Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His 
275 280 285 

Pro Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala 
290 295 300 

Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg Val lie 
305 310 315 320 

Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val Asp Ala 
325 330 335 

Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro He Glu Ala 
340 345 350 

Gly Ala Leu Met Ala Thr Tyr Gly His Glu Arg Thr Gly Asp Pro Leu 
355 360 365 

Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Thr Gin Ala Ala Ala 
370 375 380 

Gly Val Ala Gly Val He Lys Met Val Leu Ala Leu Arg His Gly Glu 
385 390 395 400 

Leu Pro Arg Thr Leu His Ala Ser Thr Ala Ser Ser Arg He Glu Trp 
405 410 415 

Asp Ala Gly Ala Val Glu Leu Leu Asp Glu Ala Arg Pro Trp Pro Arg 
420 425 430 

Arg Ala Glu Gly Pro Arg Arg Ala Gly He Ser Ser Phe Gly He Ser 
435 440 445 

Gly Thr Asn Ala His Leu Val He Glu Glu Glu Pro Pro Ala Arg Pro 
450 455 460 

Glu Pro Glu Glu Ala Ala Gin Pro Pro Ala Pro Ala Thr Thr Val Leu 
465 470 475 480 

Pro Leu Ser Ala Ala Gly Ala Arg Ser Leu Arg Glu Gin Ala Arg Arg 
485 490 495 

Leu Ala Ala His Leu Ala Gly His Glu Glu He Thr Ala Ala Asp Ala 
500 505 510 



Ala Arg Ser Ala Ala Thr Thr Arg Ala Ala Leu Ser His Arg Ala Ser 



168 



EP 0 791 655 A2 



515 520 525 

Val Leu Ala Asp Asp Arg Arg Ala Leu He Asp Arg Leu Thr Ala Leu 
530 535 540 

Ala Glu Asp Arg Lys Asp Pro Gly Val Thr Val Gly Glu Ala Gly Ser 
545 550 555 560 

Gly Arg Pro Pro Val Phe Val Phe Pro Gly Gin Gly Ser Gin Trp Thr 
565 570 575 

Gly Met Gly Ala Glu Leu Leu Asp Arg Ala Pro Val Phe Arg Ala Lys 
580 585 590 

Ala Glu Glu Cys Ala Arg Ala Leu Ala Ala His Leu Asp Trp Ser Val 
595 600 605 

Leu Asp Val Leu Arg Asp Ala Pro Gly Ala Pro Pro He Asp Arg Ala 
610 615 620 

Asp Val Val Gin Pro Thr Leu Phe Thr Met Met Val Ser Leu Ala Ala 
625 630 635 640 

Leu Trp Glu Ser His Gly Val Arg Pro Ala Ala Val Val Gly His Ser 
645 650 655 

Gin Gly Glu He Ala Ala Ala His Ala Ala Gly Ala Leu Ser Leu Asp 
660 665 670 

Asp Ala Ala Arg Val He Ala Glu Arg Ser Arg Leu Trp Lys Arg Leu 
675 680 685 

Ala Gly Asn Gly Gly Met Leu Ser Val Met Ala Pro Ala Asp Arg Val 
690 695 700 

Arg Glu Leu Met Glu Pro Trp Ala Glu Arg Met Ser Val Ala Ala Val 

705 710 715 720 

Asn Gly Pro Ala Ser Val Thr Val Ala Gly Asp Ala Arg Ala Leu Glu 
725 730 735 

Glu Phe Gly Gly Arg Leu Ser Ala Ala Gly Val Leu Arg Trp Pro Leu 
740 745 750 

Ala Gly Val Asp Phe Ala Gly His Ser Pro Gin Val Glu Gin Phe Arg 
755 760 765 

Ala Glu Leu Leu Asp Thr Leu Gly Thr Val Arg Pro Thr Ala Ala Arg 
770 775 780 

Leu Pro Phe Phe Ser Thr Val Thr Ala Ala Ala His Glu Pro Glu Gly 
785 790 795 800 

Leu Asp Ala Ala Tyr Trp Tyr Arg Asn Met Arg Glu Pro Val Glu Phe 
805 810 815 

Ala Ser Thr Leu Arg Thr Leu Leu Arg Glu Gly His Arg Thr Phe Val 
820 825 830 



Glu Met Gly Pro His Pro Leu Leu Gly Ala Ala He Asp Glu Val Ala 
835 840 845 
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Glu Ala Glu Gly Val His Ala Thr Ala Leu Ala Thr Leu His Arg Gly 
850 855 860 

Ser Gly Gly Leu Asp Arg Phe Arg Ser Ser Val Gly Ala Ala Phe Ala 

865 870 875 880 

His Gly Val Arg Val Asp Trp Asp Ala Leu Phe Glu Gly Ser Gly Ala 

885 890 895 

Arg Arg Val Pro Leu Pro Thr Tyr Ala Phe Ser Arg Asp Arg Tyr Trp 
900 905 910 

Leu Pro Thr Ala He Gly Arg Arg Ala Val Glu Ala Ala Pro Val Asp 
915 920 925 

Ala Ser Ala Pro Gly Arg Tyr Arg Val Thr Trp Thr Pro Val Ala Ser 
930 935 940 

Asp Asp Ser Gly Arg Pro Ser Gly Arg Trp Leu Leu Val Gin Thr Pro 
945 950 955 960 

Gly Thr Ala Pro Asp Glu Ala Asp Thr Ala Ala Ser Ala Leu Gly Ala 

965 970 975 

Ala Gly Val Val Val Glu Arg Cys Leu Leu Asp Pro Thr Glu Ala Ala 
980 985 990 

Arg Val Thr Leu Thr Glu Arg Leu Ala Glu Leu Asp Ala Gin Pro Glu 
995 1000 1005 

Gly Leu Ala Gly Val Leu Val Leu Pro Gly Arg Pro Gin Ser Thr Ala 
1010 1015 1020 

Pro Ala Asp Ala Ser Pro Leu Asp Pro Gly Thr Ala Ala Val Leu Leu 
1025 1030 1035 1040 

Val Val Gin Ala Val Pro Asp Ala Ala Pro Lys Ala Arg He Trp Val 
1045 1050 1055 

Val Thr Arg Gly Ala Val Ala Val Gly Ser Gly Glu Val Pro Cys Ala 
1060 1065 1070 

Val Gly Ala Arg Val Trp Gly Leu Gly Arg Val Ala Ala Leu Glu Val 
1075 1080 1085 

Pro Val Gin Trp Gly Gly Leu Val Asp Val Ala Val Gly Ala Gly Val 
1090 1095 1100 

Arg Glu Trp Arg Arg Val Val Gly Val Val Ala Gly Gly Gly Glu Asp 
1105 1110 1115 1120 

Gin Val Ala Val Arg Gly Gly Gly Val Phe Gly Arg Arg Leu Val Gly 
1125 1130 1135 

Val Gly Val Arg Gly Gly Ser Gly Val Trp Arg Ala Arg Gly Cys Val 
1140 1145 1150 

Val Val Thr Gly Gly Leu Gly Gly Val Gly Gly His Val Ala Arg Trp 
1155 1160 1165 
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Leu Ala Arg Ser Gly Ala Glu His Val Val Leu Ala Gly Arg Arg Gly 
1170 1175 1180 

Gly Gly Val Val Gly Ala Val Glu Leu Glu Arg Glu Leu Val Gly Leu 
1185 1190 1195 1200 

Gly Ala Lys Val Thr Phe Val Ser Cys Asp Val Gly Asp Arg Ala Ser 
1205 1210 1215 

Met Val Gly Leu Leu Gly Val Val Glu Gly Leu Gly Val Pro Leu Arg 

1220 1225 1230 

Gly Val Phe His Ala Ala Gly Val Ala Gin Val Ser Gly Leu Gly Glu 
1235 1240 1245 

Val Ser Leu Ala Glu Ala Gly Gly Val Leu Gly Gly Lys Ala Val Gly 
1250 1255 1260 

Ala Glu Leu Leu Asp Glu Leu Thr Ala Gly Val Glu Leu Asp Ala Phe 
1265 1270 1275 1280 

Val Leu Phe Ser Ser Gly Ala Gly Val Trp Gly Ser Gly Gly Gin Ser 
1285 1290 1295 

Val Tyr Ala Ala Ala Asn Ala His Leu Asp Ala Leu Ala Glu Arg Arg 
1300 1305 1310 

Arg Ala Gin Gly Arg Pro Ala Thr Ser Val Ala Trp Gly Leu Trp Gly 
1315 1320 1325 

Gly Glu Gly Met Gly Ala Asp Glu Gly Val Thr Glu Phe Tyr Ala Glu 
1330 1335 1340 

Arg Gly Leu Ala Pro Met Arg Pro Glu Ser Gly lie Glu Ala Leu His 
1345 1350 1355 1360 

Thr Ala Leu Asn Glu Gly Asp Thr Cys Val Thr Val Ala Asp lie Asp 
1365 1370 1375 

Trp Glu His Phe Val Thr Gly Phe Thr Ala Tyr Arg Pro Ser Pro Leu 
1380 1385 1390 

lie Ser Asp lie Pro Gin Val Arg Ala Leu Arg Thr Pro Glu Pro Thr 
1395 1400 1405 

Val Asp Ala Ser Asp Gly Leu Arg Arg Arg Val Asp Ala Ala Leu Thr 
1410 1415 1420 

Pro Arg Glu Arg Thr Lys Val Leu Val Asp Leu Val Arg Thr Val Ala 
1425 1430 1435 1440 

Ala Glu Val Leu Gly His Asp Gly He Gly Gly He Gly His Asp Val 
1445 1450 1455 

Ala Phe Arg Asp Leu Gly Phe Asp Ser Leu Ala Ala Val Arg Met Arg 
1460 1465 1470 

Gly Arg Leu Ala Glu Ala Thr Gly Leu Val Leu Pro Ala Thr Val He 
1475 1480 1485 

Phe Asp His Pro Thr Val Asp Arg Leu Gly Gly Ala Leu Leu Glu Arg 



171 



EP 0 791 655 A2 



1490 1495 1500 

Leu Ser Ala Asp Glu Pro Ala Pro Gly Gly Ala Pro Glu Pro Ala Gly 
1505 1510 1515 1520 

Gly Arg Pro Ala Thr Pro Pro Pro Ala Pro Glu Pro Ala Val His Asp 
1525 1530 1535 

Ala Asp lie Asp Glu Leu Asp Ala Asp Ala Leu lie Arg Leu Ala Thr 
1540 1545 1550 

Gly Thr Ala Gly Pro Ala Asp Gly Thr Pro Ala Asp Gly Gly Pro Asp 
1555 1560 1565 

Ala Ala Ala Thr Ala Pro Asp Gly Ala Pro Glu Gin 
1570 1575 1580 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1891 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

<ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Ser Pro Ser Met Asp Glu Val Leu Gly Ala Leu Arg Thr Ser Val 
15 10 15 

Lys Glu Thr Glu Arg Leu Arg Arg His Asn Arg Glu Leu Leu Ala Gly 
20 25 30 

Ala His Glu Pro Val Ala lie Val Gly Met Ala Cys Arg Tyr Pro Gly 
35 40 45 

Gly Val Ser Thr Pro Asp Asp Leu Trp Glu Leu Ala Ala Asp Gly Val 
50 55 60 

Asp Ala lie Thr Pro Phe Pro Ala Asp Arg Gly Trp Asp Glu Asp Ala 
65 70 75 80 

Val Tyr Ser Pro Asp Pro Asp Thr Pro Gly Thr Thr Tyr Cys Arg Glu 
85 90 95 

Gly Gly Phe Leu Thr Gly Ala Gly Asp Phe Asp Ala Ala Phe Phe Gly 
100 105 110 

He Ser Pro Asn Glu Ala Leu Val Met Asp Pro Gin Gin Arg Leu Leu 
115 120 125 

Leu Glu Thr Ser Trp Glu Thr Leu Glu Arg Ala Gly He Val Pro Ala 
130 135 140 

Ser Leu Arg Gly Ser Arg Thr Gly Val Phe Val Gly Ala Ala His Thr 
145 150 155 160 
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Gly Tyr Val Thr Asp Thr Ala Arg Ala Pro Glu Gly Thr Glu Gly Tyr 
165 170 175 

Leu Leu Thr Gly Asn Ala Asp Ala Val Met Ser Gly Arg lie Ala Tyr 
180 185 190 

Ser Leu Gly Leu Glu Gly Pro Ala Leu Thr lie Gly Thr Ala Cys Ser 
195 200 205 

Ser Ser Leu Val Ala Leu His Leu Ala Val Gin Ser Leu Arg Arg Gly 
210 215 220 

Glu Cys Asp Leu Ala Leu Ala Gly Gly Val Ala Val Met Pro Asp Pro 
225 230 235 240 

Thr Val Phe Val Glu Phe Ser Arg Gin Arg Gly Leu Ala Val Asp Gly 
245 250 255 

Arg Cys Lys Ala Phe Ala Glu Gly Ala Asp Gly Thr Ala Trp Ala Glu 

260 265 270 

Gly Val Gly Val Leu Leu Val Glu Arg Leu Ser Asp Ala Arg Arg Asn 
275 280 285 

Gly Hi3 Arg Val Leu Ala Val Val Arg Gly Ser Ala Val Asn Gin Asp 
290 295 300 

Gly Ala Ser Asn Gly Leu Thr Ala Pro Ser Gly Pro Ala Gin Gin Arg 
305 310 315 320 

Val lie Arg Glu Ala Leu Ala Asp Ala Gly Leu Thr Pro Ala Asp Val 
325 330 335 

Asp Val Val Glu Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He 
340 345 350 

Glu Ala Gly Ala Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp 
355 360 365 

Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn He Gly His Ala Gin Ala 
370 375 380 

Ala Ala Gly Val Gly Gly Val He Lys Val Val Gin Ala Met Arg His 
385 390 395 400 

Gly Ser Leu Pro Arg Thr Leu His Val Asp Ala Pro Ser Ser Lys Val 
405 410 415 

Glu Trp Ala Ser Gly Ala Val Glu Leu Leu Thr Glu Gly Arg Ser Trp 
420 425 430 

Pro Arg Arg Val Glu Arg Val Arg Arg Ala Ala Val Ser Ala Phe Gly 
435 440 445 

Val Ser Gly Thr Asn Ala His Val Val Leu Glu Glu Ala Pro Val Glu 
450 455 460 

Ala Gly Ser Glu His Gly Asp Gly Pro Gly Pro Asp Arg Pro Asp Ala 
465 470 475 480 



Val Thr Gly Pro Leu Pro Trp Val Leu Ser Ala Arg Ser Arg Glu Ala 
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485 



490 



495 



Leu Arg Gly Gin Ala Gly Arg Leu Ala Ala Leu Ala Arg Gin Gly Arg 
500 505 510 

Thr Glu Gly Thr Gly Gly Gly Ser Gly Leu Val Val Pro Ala Ala Asp 
515 520 525 

lie Gly Tyr Ser Leu Ala Thr Thr Arg Glu Thr Leu Glu His Arg Ala 
530 535 540 

Val Ala Leu Val Gin Glu Asn Arg Thr Ala Gly Glu Asp Leu Ala Ala 
545 550 555 560 

Leu Ala Ala Gly Arg Thr Pro Glu Ser Val Val Thr Gly Val Ala Arg 
565 570 575 

Arg Gly Arg Gly He Ala Phe Leu Cys Ser Gly Gin Gly Ala Gin Arg 
580 585 590 

Leu Gly Ala Gly Arg Glu Leu Arg Gly Arg Phe Pro Val Phe Ala Asp 
595 600 605 

Ala Leu Asp Glu He Ala Ala Glu Phe Asp Ala His Leu Glu Arg Pro 

610 615 620 

Leu Leu Ser Val Met Phe Ala Glu Pro Ala Thr Pro Asp Ala Ala Leu 

625 630 635 640 

Leu Asp Arg Thr Asp Tyr Thr Gin Pro Ala Leu Phe Ala Val Glu Thr 
645 650 655 

Ala Leu Phe Arg Leu Leu Glu Ser Trp Gly Leu Val Pro Asp Val Leu 
660 665 670 

Val Gly His Ser He Gly Gly Leu Val Ala Ala His Val Ala Gly Val 
675 680 685 

Phe Ser Ala Ala Asp Ala Ala Arg Leu Val Ser Ala Arg Gly Arg Leu 
690 695 700 

Met Arg Ala Leu Pro Glu Gly Gly Ala Met Ala Ala Val Gin Ala Thr 
705 710 715 720 

Glu Arg Glu Ala Ala Ala Leu Glu Pro Val Ala Ala Gly Gly Ala Val 
725 730 735 

Val Ala Ala Val Asn Gly Pro Gin Ala Leu Val Leu Ser Gly Asp Glu 
740 745 750 

Ala Ala Val Leu Ala Ala Ala Gly Glu Leu Ala Ala Arg Gly Arg Arg 
755 760 765 

Thr Lys Arg Leu Arg Val Ser His Ala Phe His Ser Pro Arg Met Asp 

770 775 780 

Ala Met Leu Ala Asp Phe Arg Ala Val Ala Asp Thr Val Asp Tyr His 
785 790 795 800 



Ala Pro Arg Leu Pro Val Val Ser Glu Val Thr Gly Asp Leu Ala Asp 
805 810 815 
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Ala Ala Gin Leu Thr Asp Pro Gly Tyr Trp Thr Arg Gin Val Arg Gin 
820 825 830 

Pro Val Arg Phe Ala Asp Ala Val Arg Thr Ala Ser Ala Arg Asp Ala 
835 840 845 

Ala Thr Phe He Glu Leu Gly Pro Asp Ala Val Leu Cys Gly Met Ala 
850 855 860 

Glu Glu Ser Leu Ala Ala Glu Ala Asp Val Val Phe Ala Pro Ala Leu 
865 870 875 880 

Arg Arg Gly Arg Pro Glu Gly Asp Thr Val Leu Arg Ala Ala Ala Ser 
885 890 895 

Ala Tyr Val Arg Gly Ala Gly Leu Asp Trp Ala Ala Leu Tyr Gly Gly 
900 905 910 

Thr Gly Ala Arg Arg Thr Asp Leu Pro Thr Tyr Ala Phe Gin His Ser 
915 920 925 

Arg Tyr Trp Leu Ala Pro Ala Ser Ala Ala Val Ala Pro Ala Thr Ala 
930 935 940 

Ala Pro Ser Val Arg Ser Val Pro Glu Ala Glu Gin Asp Gly Ala Leu 
945 950 955 960 

Trp Ala Ala Val His Ala Gly Asp Val Ala Ser Ala Ala Ala Arg Leu 

965 970 975 

Gly Ala Asp Asp Ala Gly He Glu His Glu Leu Arg Ala Val Leu Pro 
980 985 990 

His Leu Ala Ala Trp His Asp Arg Asp Arg Ala Thr Ala Arg Thr Ala 
995 1000 1005 

Gly Leu His Tyr Arg Val Thr Trp Gin Ala He Glu Ala Asp Ala Val 
1010 1015 1020 

Arg Phe Ser Pro Ser Asp Arg Trp Leu Met Val Glu His Gly Gin His 
1025 1030 1035 1040 

Thr Glu Cys Ala Asp Ala Ala Glu Arg Ala Leu Arg Ala Ala Gly Ala 
1045 1050 1055 

Glu Val Thr Arg Leu Val Trp Pro Leu Glu Gin His Thr Gly Ser Pro 
1060 1065 1070 

Arg Thr Glu Thr Pro Asp Arg Gly Thr Leu Ala Ala Arg Leu Ala Glu 
1075 1080 1085 

Leu Ala Arg Ser Pro Glu Gly Leu Ala Gly Val Leu Leu Leu Pro Asp 
1090 1095 1100 

Ser Gly Gly Ala Ala Val Ala Gly His Pro Gly Leu Asp Gin Gly Thr 
1105 1110 1115 1120 

Ala Ala Val Leu Leu Thr He Gin Ala Leu Thr Asp Ala Ala Val Arg 
1125 1130 1135 
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Ala Pro Leu Trp Val Val Thr Arg Gly Ala Val Ala Val Gly Ser Gly 
1140 1145 1150 

Glu Val Pro Cys Ala Val Gly Ala Arg Val Trp Gly Leu Gly Arg Val 
1155 1160 1165 

Ala Ala Leu Glu Val Pro Val Gin Trp Gly Gly Leu Val Asp Val Ala 
1170 1175 1180 

Val Gly Ala Gly Val Arg Glu Trp Arg Arg Val Val Gly Val Val Ala 
1185 1190 1195 1200 

Gly Gly Gly Glu Asp Gin Val Ala Val Arg Gly Gly Gly Val Phe Gly 
1205 1210 1215 

Arg Arg Leu Val Gly Val Gly Val Arg Gly Gly Ser Gly Val Trp Arg 
1220 1225 1230 

Ala Arg Gly Cys Val Val Val Thr Gly Gly Leu Gly Gly Val Gly Gly 
1235 1240 1245 

His Val Ala Arg Trp Leu Ala Arg Ser Gly Ala Glu His Val Val Leu 
1250 1255 1260 

Ala Gly Arg Arg Gly Gly Gly Val Val Gly Ala Val Glu Leu Glu Arg 
1265 1270 1275 1280 

Glu Leu Val Gly Leu Gly Ala Lys Val Thr Phe Val Ser Cys Asp Val 
1285 1290 1295 

Gly Asp Arg Ala Ser Val Val Gly Leu Leu Gly Val Val Glu Gly Leu 
1300 1305 1310 

Gly Val Pro Leu Arg Gly Val Phe His Ala Ala Gly Val Ala Gin Val 

1315 1320 1325 

Ser Gly Leu Gly Glu Val Ser Leu Ala Glu Ala Gly Gly Val Leu Gly 
1330 1335 1340 

Gly Lys Ala Val Gly Ala Glu Leu Leu Asp Glu Leu Thr Ala Gly Val 
1345 1350 1355 1360 

Glu Leu Asp Ala Phe Val Leu Phe Ser Ser Gly Ala Gly Val Trp Gly 
1365 1370 1375 

Ser Gly Gly Gin Ser Val Tyr Ala Ala Ala Asn Ala His Leu Asp Ala 
1380 1385 1390 

Leu Ala Glu Arg Arg Arg Ala Gin Gly Arg Pro Ala Thr Ser Val Ala 
1395 1400 1405 

Trp Gly Pro Trp Asp Gly Asp Gly Met Gly Glu Met Ala Pro Glu Gly 
1410 1415 1420 

Tyr Phe Ala Arg His Gly Val Ala Pro Leu His Pro Glu Thr Ala Leu 
1425 1430 1435 1440 

Thr Ala Leu His Gin Ala He Asp Gly Gly Glu Ala Thr Val Thr Val 
1445 1450 1455 

Ala Asp He Asp Trp Glu Arg Phe Ala Pro Gly Phe Thr Ala Phe Arg 
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1460 1465 1470 

Pro Ser Pro Leu He Ala Gly He Pro Ala Ala Arg Thr Ala Pro Ala 
1475 1480 1485 

Ala Gly Arg Pro Ala Glu Asp Thr Pro Thr Ala Pro Gly Leu Leu Arg 
1490 1495 1500 

Ala Arg Pro Glu Asp Arg Pro Arg Leu Ala Leu Asp Leu Val Leu Arg 
1505 1510 1515 1520 

His Val Ala Ala Val Leu Gly His Ser Glu Asp Ala Arg Val Asp Ala 
1525 1530 1535 

Arg Ala Pro Phe Arg Asp Leu Gly Phe Asp Ser Leu Ala Ala Val Arg 
1540 1545 1550 

Leu Arg Arg Arg Leu Ala Glu Asp Thr Gly Leu Asp Leu Pro Gly Thr 
1555 1560 1565 

Leu Val Phe Asp His Glu Asp Pro Thr Ala Leu Ala His His Leu Ala 
1570 1575 1580 

Gly Leu Ala Asp Ala Gly Thr Pro Gly Pro Gin Glu Gly Thr Ala Arg 
1585 1590 1595 1600 

Ala Glu Ser Gly Leu Phe Ala Ser Phe Arg Ala Ala Val Glu Gin Arg 
1605 1610 1615 

Arg Ser Ser Glu Val Val Glu Leu Met Ala Asp Leu Ala Ala Phe Arg 
1620 1625 1630 

Pro Ala Tyr Ser Arg Gin His Pro Gly Ser Gly Arg Pro Ala Pro Val 
1635 1640 1645 

Pro Leu Ala Thr Gly Pro Ala Thr Arg Pro Thr Leu Tyr Cys Cys Ala 
1650 1655 1660 

Gly Thr Ala Val Gly Ser Gly Pro Ala Glu Tyr Val Pro Phe Ala Glu 
1665 1670 1675 1680 

Gly Leu Arg Gly Val Arg Glu Thr Val Ala Leu Pro Leu Ser Gly Phe 
1685 1690 1695 

Gly Asp Pro Ala Glu Pro Met Pro Ala Ser Leu Asp Ala Leu lie Glu 
1700 1705 1710 

Val Gin Ala Asp Val Leu Leu Glu His Thr Ala Gly Lys Pro Phe Ala 
1715 1720 1725 

Leu Ala Gly His Ser Ala Gly Ala Asn He Ala His Ala Leu Ala Ala 
1730 1735 1740 

Arg Leu Glu Glu Arg Gly Ser Gly Pro Ala Ala Val Val Leu Met Asp 
1745 1750 1755 1760 

Val Tyr Arg Pro Glu Asp Pro Gly Ala Met Gly Glu Trp Arg Asp Asp 
1765 1770 1775 

Leu Leu Ser Trp Ala Leu Glu Arg Ser Thr Val Pro Leu Glu Asp His 
1780 1785 1790 
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15 



20 



Arg Leu Thr Ala Met Ala Gly Tyr Gin Arg Leu Val Leu Gly Thr Arg 
1795 1800 1805 

Leu Thr Ala Leu Glu Ala Pro Val Leu Leu Ala Arg Ala Ser Glu Pro 
1810 1815 1820 

Leu Cys Ala Trp Pro Pro Ala Gly Gly Ala Arg Gly Asp Trp Arg Ser 
1825 1830 1835 1840 

Gin Val Pro Phe Ala Arg Thr Val Ala Asp Val Pro Gly Asn His Phe 
1845 1850 1855 

Thr Met Leu Thr Glu His Ala Arg His Thr Ala Ser Leu Val His Glu 
1860 1865 1870 

Trp Leu Asp Ser Leu Pro His Gin Pro Gly Pro Ala Pro Leu Thr Gly 
1875 1880 1885 

Gly Lys His 
1890 



(2) INFORMATION FOR SEQ ID NO: 13: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 350.. 13987 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GACCGCTCGG GGAGACCTGA CATATTCGTC GCGAAGTGGT TGTCCGCGCC GCGAGGTACT 60 

40 GAAATCTTCT CCGCTCGCCC AGGACTCCGC GTGCAGGTCA CCGGAGTGCG CGACCGGCCG 120 

GGACGTCGGA GCGCCGACCC TGCGGACCTG GTGCGATGCC GTGTGGTCCC GCATGATCCC 180 

GCGCCGTCTC CGGTGACGAG AATCGGTGGA CAATCTCCGA ACTTGACACA ATTGATTGTC 240 

45 GTTCACCGGC CGTTCCTGTC GCCCGGCAGT TCGCCCGCTG TACGCTCGGG AAGATCAAGA 300 

AAAGGCAGAA AAGCCACGGC GTGGTACGGC GAACATATGA GGGATGCAG GTG TCT 355 

Met Ser 
1 

50 

GGA GAA CTC GCG ATT TCC CGC AGT GAC GAC CGG TCC GAC GCC GTT GCC 403 
Gly Glu Leu Ala lie Ser Arg Ser Asp Asp Arg Ser Asp Ala Val Ala 
5 10 15 

GTG GTC GGA ATG GCG TGC CGG TTT CCC GGC GCC CCG GGA ATT GCC GAA 451 
55 Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He Ala Glu 



(i) 
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20 25 30 

TTC TGG GAA CTG CTG CGC AGC GGA CGC GGT ATG CCC ACC CGT CAG GAC 499 
Phe Trp Glu Leu Leu Arg Ser Gly Arg Gly Met Pro Thr Arg Gin Asp 
35 40 45 50 

GAC GGC ACC TGG CGG GCC GCC CTG GAG GAC CAC GCC GGC TTC GAC GCC 547 
Asp Gly Thr Trp Arg Ala Ala Leu Glu Asp His Ala Gly Phe Asp Ala 
55 60 65 

GGG TTC TTC GGC ATG AAC GCC CGG CAG GCC GCC GCC ACC GAC CCG CAG 595 
Gly Phe Phe Gly Met Asn Ala Arg Gin Ala Ala Ala Thr Asp Pro Gin 
70 75 80 

CAC CGA CTG ATG CTG GAA CTC GGA TGG GAG GCA CTG GAG GAC GCG GGC 643 
*5 His Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp Ala Gly 

85 90 95 

ATC GTC CCC GGC GAT CTC ACC GGC ACC GAC ACC GGA GTC TTC GCC GGC 691 
lie Val Pro Gly Asp Leu Thr Gly Thr Asp Thr Gly Val Phe Ala Gly 
2Q 100 105 110 

GTG GCG TCC GAC GAC TAT GCC GTT CTC ACC CGC CGT TCC GCC GTC TCC 739 
Val Ala Ser Asp Asp Tyr Ala Val Leu Thr Arg Arg Ser Ala Val Ser 
115 120 125 130 

25 GCC GGG GGA TAC ACC GCC ACG GGG CTG CAC CGC GCC CTG GCC GCC AAC 787 

Ala Gly Gly Tyr Thr Ala Thr Gly Leu His Arg Ala Leu Ala Ala Asn 
135 140 145 

CGC CTC TCC CAC TTC CTG GGC CTG CGC GGC CCC AGC CTG GTC GTC GAC 835 
Arg Leu Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val Val Asp 
150 155 160 

TCG GCC CAG TCC GCC TCA CTG GTG GCC GTC CAG CTC GCC TGC GAG AGT 883 
Ser Ala Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys Glu Ser 
165 170 175 

CTG CGC CGG GGT GAG ACG TCG CTC GCC GTC GCG GGC GGT GTC AAC CTC 931 
Leu Arg Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val Asn Leu 
180 185 190 

ATC CTC ACC GAG GAG AGC ACC ACC GTC ATG GAG CGT ATG GGA GCG CTC 979 
He Leu Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly Ala Leu 
195 200 205 210 



30 



35 



40 



TCA CCC GAC GGC CGC TGC CAC ACC TTC GAC GCC CGC GCC AAC GGC TAC 1027 

Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn Gly Tyr 
45 215 220 225 

GTA CGC GGC GAG GGC GGC GGA GCC GTC GTG CTC AAG CCA CTG GAC GCC 1075 

Val Arg Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu Asp Ala 
230 235 240 



50 



55 



GCA CTC GCC GAC GGC GAC CGC GTG TAC TGC GTC ATC AAG GGA GGT GCC 1123 
Ala Leu Ala Asp Gly Asp Arg Val Tyr Cys Val He Lys Gly Gly Ala 
245 250 255 

GTC AAC AAC GAC GGC GGC GGC GCG AGC CTC ACC ACT CCC GAC CGG GAG 1171 
Val Asn Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp Arg Glu 
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260 265 270 

GCG CAG GAA GCT GTG CTG CGC CAG GCC TAC CGG CGG GCG GGC GTC AGC 1219 
Ala Gin Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly Val Ser 
275 280 285 290 

ACC GGC GCC GTC CGC TAC GTC GAG CTG CAC GGG ACC GGC ACC CGG GCC 1267 
Thr Gly Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr Arg Ala 
295 300 305 

GGC GAC CCC GTC GAG GCG GCC GCA CTG GGC GCC GTG CTC GGG GCG GGG 1315 
Gly Asp Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly Ala Gly 
310 315 320 

GCG GAC AGC GGC CGC AGC ACG CCG CTC GCC GTC GGC TCG GTG AAG ACC 1363 
Ala Asp Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val Lys Thr 
325 330 335 

AAC GTC GGC CAT CTG GAG GGC GCG GCG GGC ATC GTC GGA CTG ATC AAG 1411 
Asn Val Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu lie Lys 
340 345 350 

GCC ACG CTG TGC GTA CGG AAG GGC GAA CTC GTC CCC AGC CTC AAC TTC 1459 
Ala Thr Leu Cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu Asn Phe 
355 360 365 370 

2S AGC ACG CCG AAC CCT GAC ATC CCC CTC GAC GAC CTG CGG CTG CGC GTC 15,07 

Ser Thr Pro Asn Pro Asp lie Pro Leu Asp Asp Leu Arg Leu Arg Val 
375 380 385 
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CAG ACC GAA CGG CAG GAG TGG AAC GAG GAG GAC GAC CGG CCG CGC GTG 1555 
Gin Thr Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro Arg Val 
390 395 400 

GCC GGC GTC TCC TCC TTC GGT ATG GGC GGA ACC AAT GTC CAC CTC GTG 1603 
Ala Gly Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His Leu Val 
405 410 415 

ATC GCG GAG GCT CCG GCC GCG GCG GGG TCC TCC GGG GCG GGG GGT TCG 1651 
lie Ala Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly Gly Ser 
420 425 430 

GGC GCT GGT TCC GGT GCC GGT ATC AGC GCT GTT TCT GGT GTG GTG CCG 1699 
Gly Ala Gly Ser Gly Ala Gly He Ser Ala Val Ser Gly Val Val Pro 
435 440 445 450 

GTG GTG GTT TCG GGG CGT TCG CGG GTG GTG GTG CGG GAG GCT GCG GGC 1747 
Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala Ala Gly 
455 460 465 

CGG TTG GCG GAG GTG GTG GAG GCC GGT GGT GTG GGG CTG GCG GAT GTG 1795 
Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala Asp Val 
470 475 480 

GCG GTG ACG ATG GCG GAC CGG TCG CGG TTT GGG TAT CGG GCG GTT GTG 1843 
Ala Val Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala Val Val 
485 490 495 

CTG GCT CGG GGT GAG GCT GAG CTT GCC GGG CGT TTG CGG GCG TTG GCG 1891 
Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala Leu Ala 
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500 505 510 

GGG GGT GAT CCG GAC GCG GGT GTG GTC ACC GGT GCG GTT CTC GAC GGT 1939 
Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu Asp Gly 
515 520 525 530 

GGT GTG GTT GTC GGT GCT GCC CCC GGC GGT GCC GGT GCT GCC GGT GGT 1987 
Gly Val Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala Gly Gly 
535 540 545 

GCC GGT GCT GCC GGT GGT GCC GGT GGT GGG GGC GTG GTG TTG GTT TTC 2035 
Ala Gly Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu Val Phe 
550 555 560 

CCT GGT CAG GGG ACG CAG TGG GTG GGG ATG GGT GCG GGG CTG CTG GGG 2083 
Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu Leu Gly 
565 570 575 

TCT TCG GAG GTG TTT GCG GCG TCG ATG CGG GAG TGT GCG CGG GCG CTG 2131 
Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg Ala Leu 
580 585 590 

AGT GTT CAT GTG GGG TGG GAT TTG CTG GAG GTG GTG TCG GGC GGG GCC 2179 
Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly Gly Ala 
595 600 605 610 

25 GGG TTG GAG CGG GTG GAT GTG GTG CAG CCG GTG ACG TGG GCG GTG ATG 2227 

Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala Val Met 
615 620 625 
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GTG TCG CTG GCC CGG TAC TGG CAG GCG ATG GGT GTG GAC GTG GCT GCG 2275 
Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val Ala Ala 
630 635 640 

GTG GTG GGT CAT TCC CAG GGG GAG ATC GCC GCT GCC ACG GTG GCG GGG 2323 
Val Val Gly His Ser Gin Gly Glu lie Ala Ala Ala Thr Val Ala Gly 
645 650 655 

GCG TTG TCG CTG GAG GAT GCG GCG GCT GTG GTC GCT CTG CGG GCG GGG 2371 
Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg Ala Gly 
660 665 670 

TTG ATT GGC CGG TAT CTG GCG GGT CGT GGT GCG ATG GCG GCT GTT CCG 2419 
Leu lie Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala Val Pro 
675 680 685 690 

CTG CCT GCC GGC GAG GTC GAG GCC GGG CTG GCG AAG TGG CCG GGT GTG 2467 
Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro Gly Val 
695 700 705 

GAG GTC GCG GCG GTC AAC GGT CCG GCG TCT ACG GTG GTT TCC GGG GAT 2515 
Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser Gly Asp 
710 715 720 

CGG CGG GCG GTG GCC GGT TAT GTG GCC GTC TGT CAG GCG GAG GGT GTG 2563 
Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu Gly Val 
725 730 735 

CAG GCC CGG TTG ATA CCG GTG GAC TAC GCC TCT CAC TCC CGC CAT GTG 2611 
Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg His Val 
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740 745 750 

GAG GAC CTG AAG GGC GAG TTG GAG CGG GTG CTG TCC GGT ATC CGC CCC 2659 
Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly lie Arg Pro 
755 760 765 770 

CGC AGT CCG CGG GTG CCG GTG TGT TCC ACC GTC GCC GGA GAG CAG CCG 2707 
Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu Gin Pro 
775 780 785 

GGC GAG CCG GTT TTC GAT GCG GGG TAT TGG TTC CGT AAT CTG CGG AAC 2755 
Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu Arg Asn 
790 795 800 

CGG GTT GAG TTC TCC GCG GTG GTC GGT GGT TTG TTG GAG GAG GGC CAC 2803 
'5 Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu Gly His 

805 810 815 

CGT CGG TTC ATC GAG GTC AGT GCC CAC CCG GTA CTC GTC CAT GCG ATC 2851 
Arg Arg Phe He Glu Val Ser Ala His Pro Val Leu Val His Ala He 
20 820 825 830 

GAG CAG ACG GCC GAG GCC GCG GAC CGG AGT GTC CAT GCC ACC GGG ACC 2899 
Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr Gly Thr 
835 840 845 850 

25 CTG CGC CGC CAG GAC GAC AGC CCG CAC CGC CTG CTG ACC TCC ACC GCC 2947 

Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser Thr Ala 
855 860 865 
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GAG GCC TGG GCC CAC GGC GCC ACC CTC ACC TGG GAC CCC GCC CTG CCC 2995 
Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala Leu Pro 
870 875 880 

CCA GGC CAC CTC ACC ACC CTC CCC ACC TAC CCC TTC AAC CAC CAC CAC 3043 
Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His His His 
885 890 895 

TAC TGG CTC GAC ACC ATT GAC GGG GGC GGA GGG GAC GAC GCG ACC CAG 3091 
Tyr Trp Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala Thr Gin 
900 905 910 

GAG AAG GAG AGC GGC CCT CTG ACG CGG GAA CTG CGT GGG CTG CCG TCC 3139 
Glu Lys Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu Pro Ser 
915 920 925 930 

TCT CAG AAG CAA CTG GGT TTC CTG CTC GAT CTG GTG TGC CGG CAC ACG 3187 
Ser Gin Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg His Thr 
935 940 945 

GCC GTC GTA CTC GGC CTG GAC ACG GCC GCC GAG GTG GAC CCG GAC CTG 3235 
Ala Val Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro Asp Leu 
950 955 960 

TCC TTC AAG AAG CAG GGC ATC CAG TCC ATG ACC GGC GTC GAG CTG CGC 3283 
Ser Phe Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu Leu Arg 
965 970 975 

AAC AGG CTG CTG ACC GAG ACC GGC CTG GCA TTG CCC ACC ACC CTC GTC 3331 
Asn Arg Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr Leu Val 
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980 985 990 

TAG GAC CGG CCC ACC CCT CGC GCC CTG GCG CAG TTC CTC CAC ACC GAG 3379 
Tyr Asp Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His Thr Glu 
995 1000 1005 1010 

TTG CTC GAC GGC TCC CCC TCG GGC TCC GTC CTC GCA CCG GCG CAG AAG 3427 
Leu Leu Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala Gin Lys 
1015 1020 1025 

AGC TTC GAG GCC GGC GGG CCC GGA GTG CTC TCG TCG GCC GCG GTA GGG 3475 
Ser Phe Glu Ala Gly Gly Pro Gly Val Leu Ser Ser Ala Ala Val Gly 
1030 1035 1040 

GTG TCG GAC GCC CGG GGC GGC AGC CGG GAC GAC GAC GAC CCG ATC GCC 3523 
Val Ser Asp Ala Arg Gly Gly Ser Arg Asp Asp Asp Asp Pro lie Ala 
1045 1050 1055 

ATC GTG GGT GTC GGC TGC CGG CTC CCC GGC GGC GTC GAC TCG CGC GCC 3571 
He Val Gly Val Gly Cys Arg Leu Pro Gly Gly Val Asp Ser Arg Ala 
1060 1065 1070 

GCT CTC TGG GAG CTG CTG GAG TCC GGC GCC GAC GCC ATC TCG TCC TTC 3619 
Ala Leu Trp Glu Leu Leu Glu Ser Gly Ala Asp Ala He Ser Ser Phe 
1075 1080 1085 1090 

25 CCC ACC GAC CGC GGC TGG GAC CTC GAC GGG CTG TAC GAC CCC GAG CCC 3667 

Pro Thr Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr Asp Pro Glu Pro 
1095 1100 1105 



W 



15 



20 



30 



35 



GGG ACG CCC GGC AAG ACC TAT GTG CGG GAG GGC GGG TTC CTG CAC TCG 3715 
Gly Thr Pro Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu His Ser 
1110 1115 1120 

GCG GCC GAG TTC GAC GCG GAG TTC TTC GGG ATA TCG CCG CGC GAG GCC 37 63 

Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala 
1125 1130 1135 

ACG GCC ATG GAC CCG CAG CAG CGC TTG CTG CTG GAA GCG TCG TGG GAG 3811 
Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Ala Ser Trp Glu 
1140 1145 1150 

40 GCC CTC GAG GAC GCC GGA GTG CTC CCC GAG TCA CTG CGC GGC GGC GAC 3859 

Ala Leu Glu Asp Ala Gly Val Leu Pro Glu Ser Leu Arg Gly Gly Asp 
1155 1160 1165 1170 

GCC GGA GTG TTC GTC GGC GCC ACC GCA CCG GAG TAC GGG CCG AGG CTT 3907 
Ala Gly Val Phe Val Gly Ala Thr Ala Pro Glu Tyr Gly Pro Arg Leu 
45 1175 1180 1185 

CAC GAG GGA GCG GAC GGA TAC GAG GGG TAC CTG CTC ACC GGC ACC ACC 3955 
His Glu Gly Ala Asp Gly Tyr Glu Gly Tyr Leu Leu Thr Gly Thr Thr 
1190 1195 1200 



so 
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GCG AGC GTG GCC TCC GGC CGG ATC GCC TAC ACC CTC GGC ACC GGC GGA 4003 

Ala Ser Val Ala Ser Gly Arg He Ala Tyr Thr Leu Gly Thr Gly Gly 
1205 1210 1215 

CCG GCG CTC ACC GTC GAC ACC GCG TGC TCC TCG TCC CTG GTG GCG CTG 4051 

Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu 
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1220 1225 1230 

CAC CTG GCC GTG CAG GCG CTG CGC CGG GGC GAG TGC GGG CTG GCT CTG 4099 
His Leu Ala Val Gin Ala Leu Arg Arg Gly Glu Cys Gly Leu Ala Leu 
1235 1240 1245 1250 

GCG GGC GGC GCC ACG GTG ATG TCG GGG CCC GGC ATG TTC GTG GAG TTC 4147 
Ala Gly Gly Ala Thr Val Met Ser Gly Pro Gly Met Phe Val Glu Phe 
1255 1260 1265 

TCG CGG CAG CGC GGG CTC GCC CCC GAC GGC CGC TGC ATG CCG TTC TCC 4195 
Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Met Pro Phe Ser 
1270 1275 1280 

GCC GAT GCC GAC GGT ACG GCC TGG TCC GAG GGT GTC GCC GTA CTG GCA 4243 
Ala Asp Ala Asp Gly Thr Ala Trp Ser Glu Gly Val Ala Val Leu Ala 
1285 1290 1295 

CTG GAG CGG CTC TCC GAC GCC CGG CGT GCG GGA CAC CGG GTG CTG GGC 4291 
Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly His Arg Val Leu Gly 
1300 1305 1310 

GTG GTG CGG GGC AGT GCG GTC AAC CAG GAC GGT GCC AGC AAC GGC CTG 4339 
Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu 
1315 1320 1325 1330 

ACC GCT CCC AAC CGC TCC GCG CAG GAG GGC GTC ATC CGA GCT GCC CTG 4387 
Thr Ala Pro Asn Arg Ser Ala Gin Glu Gly Val lie Arg Ala Ala Leu 
1335 1340 1345 

GCC GAC GCC GGC CTC GCG CCG GGT GAC GTG GAC GCG GTG GAG GCG CAC 443 5 

Ala Asp Ala Gly Leu Ala Pro Gly Asp Val Asp Ala Val Glu Ala His 
1350 1355 1360 

GGT ACG GGG ACG GCG CTG GGC GAT CCG ATC GAG GCG AGC GCG CTG CTG 4483 
Gly Thr Gly Thr Ala Leu Gly Asp Pro lie Glu Ala Ser Ala Leu Leu 
1365 1370 1375 

GCC ACG TAC GGG CGT GAG CGG GTG GGC GAC CCC TTG TGG CTC GGG TCG 4531 
Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser 
1380 1385 1390 

40 CTG AAG TCC AAC GTC GGT CAC ACC CAG GCC GCC GCG GGG GCC GCG GGT 4579 

Leu Lys Ser Asn Val Gly His Thr Gin Ala Ala Ala Gly Ala Ala Gly 
1395 1400 1405 1410 
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GTG GTC AAG ATG CTG CTT GCC CTG GAG CAC GGC ACG CTG CCG CGG ACA 4627 
Val Val Lys Met Leu Leu Ala Leu Glu His Gly Thr Leu Pro Arg Thr 
1415 1420 1425 

CTT CAC GCG GAC CGG CCC AGC ACG CAC GTC GAC TGG TCG TCG GGC ACC 4 675 

Leu His Ala Asp Arg Pro Ser Thr His Val Asp Trp Ser Ser Gly Thr 
1430 1435 1440 

GTC GCC CTG CTG GCA GAG GCG CGC CGG TGG CCC CGG CGG TCG GAC CGC 4723 
Val Ala Leu Leu Ala Glu Ala Arg Arg Trp Pro Arg Arg Ser Asp Arg 
1445 1450 1455 

CCG CGC CGG GCG GCT GTG TCG TCG TTC GGG ATC AGT GGG ACG AAC GCG 4771 
Pro Arg Arg Ala Ala Val Ser Ser Phe Gly lie Ser Gly Thr Asn Ala 
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1460 1465 1470 

CAT CTG ATC ATC GAG GAG GCG CCG GAG TGG GTC GAG GAC ATC GAC GGC 4819 
His Leu He He Glu Glu Ala Pro Glu Trp Val Glu Asp He Asp Gly 
1475 1480 1485 1490 

GTC GCT GCT CCT GAC CGC GGT ACC GCG GAC GCG GCT GCT CCG TCG CCG 4867 
Val Ala Ala Pro Asp Arg Gly Thr Ala Asp Ala Ala Ala Pro Ser Pro 
1495 1500 1505 

CTG TTG TTG TCC GCG CGG TCG GAG GGG GCG TTG CGG GCG CAG GCG GTG 4915 
Leu Leu Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val 
1510 1515 1520 

CGG TTG GGT GAG TAC GTG GAG CGG GTG GGT GCG GAT CCG CGG GAT GTG 4963 
Arg Leu Gly Glu Tyr Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val 
1525 1530 1535 

GCT TAT TCG CTG GCT TCG ACG CGG ACT CTT TTC GAG CAC CGT GCG GTG 5011 
Ala Tyr Ser Leu Ala Ser Thr Arg Thr Leu Phe Glu His Arg Ala Val 
1540 1545 1550 

GTG CCG TGT GGT GGG CGT GGG GAG CTC GTC GCT GCT CTT GGT GGG TTT 5059 
Val Pro Cys Gly Gly Arg Gly Glu Leu Val Ala Ala Leu Gly Gly Phe 
1555 1560 1565 1570 

GCT GCC GGG AGG GTG TCT GGG GGT GTG CGG TCC GGG CGG GCT GTG CCG 5107 
Ala Ala Gly Arg Val Ser Gly Gly Val Arg Ser Gly Arg Ala Val Pro 
1575 1580 1585 

GGT GGG GTG GGG GTG TTG TTC ACG GGT CAG GGT GCG CAG TGG GTT GGT 5155 
Gly Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp Val Gly 
1590 1595 1600 

ATG GGG CGT GGG TTG TAT GCG GGG GGT GGG GTG TTT GCG GAG GTG CTG 5203 
Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu Val Leu 
1605 1610 1615 

GAT GAG GTG TTG TCG ATG GTG GGG GAG GTG GAT GGT CGG TCG TTG CGG 5251 
Asp Glu Val Leu Ser Met Val Gly Glu Val Asp Gly Arg Ser Leu Arg 
1620 1625 1630 

GAT GTG ATG TTC GGC GAC GTC GAC GTG GAC GCG GGT GCC GGG GCT GAT 5299 
Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly Ala Asp 
1635 1640 1645 1650 

GCG GGT GCC GGT GCG GGT GCT GGG GTC GGT TCT GGT TCC GGT TCT GTG 5347 
Ala Gly Ala Gly Ala Gly Ala Gly Val Gly Ser Gly Ser Gly Ser Val 
1655 1660 1665 

GGT GGG TTG TTG GGT CGG ACG GAG TTT GCT CAG CCT GCG CTG TTT GCG 5395 
Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala 
1670 1675 1680 

TTG GAG GTG GCG TTG TTC CGG GCG TTG GAG GCT CGG GGT GTG GAG GTG 5443 
Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val 
1685 1690 1695 

TCG GTG GTG TTG GGT CAT TCG GTG GGG GAG GTG GCT GCT GCG TAT GTG 5491 
Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr Val 
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1700 1705 1710 

GCG GGG GTG TTG TCG TTG GGT GAT GCG GTG CGG TTG GTG GTG GCG CGG 5539 
Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg 
1715 1720 1725 1730 

GGT GGG TTG ATG GGT GGG TTG CCG GTG GGT GGG GGG ATG TGG TCG GTG 5587 
Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val 
1735 1740 1745 

GGG GCG TCG GAG TCG GTG GTG CGG GGG GTT GTT GAG GGG TTG GGG GAG 5635 
Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly Glu 
1750 1755 1760 

TGG GTG TCG GTT GCG GCG GTG AAT GGG CCG CGG TCG GTG GTG TTG TCG 5683 
Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser 
1765 1770 1775 

GGT GAT GTG GGT GTG CTG GAG TCG GTG GTT GCC TCG CTG ATG GGG GAT 5731 
Gly Asp Val Gly Val Leu Glu Ser Val Val Ala Ser Leu Met Gly Asp 
1780 1785 1790 

GGG GTG GAG TGC CGG CGG TTG GAT GTG TCG CAT GGG TTT CAT TCG GTG 5779 
Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser Val 
1795 1800 1805 1810 

25 TTG ATG GAG CCG GTG TTG GGG GAG TTC CGG GGG GTT GTG GAG TCG TTG 5827 

Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu 
1815 1820 1825 
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GAG TTC GGT CGG GTG CGG CCG GGT GTG GTG GTG GTG TCG GGT GTG TCG 5875 
Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val Ser 
1830 1835 1840 

GGT GGG GTG GTG GGT TCG GGG GAG TTG GGG GAT CCG GGG TAT TGG GTG 5923 
Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp Val 
1845 1850 1855 

CGT CAT GCG CGG GAG GCG GTG CGT TTC GCG GAT GGG GTG GGG GTG GTG 5971 
Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val Val 
I860 1865 1870 

40 CGT °GT CTG GGT GTG GGG ACG TTG GTG GAG GTG GGT CCG CAT GGG GTG 6019 

Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly Val 
1875 1880 1885 1890 

CTG ACG GGG ATG GCG GGT GAG TGC CTG GGG GCC GGT GAT GAT GTG GTG 6067 
Leu Thr Gly Met Ala Gly Glu Cys Leu Gly Ala Gly Asp Asp Val Val 
45 1895 1900 1905 

GTG GTG CCG GCG ATG CGG CGG GGC CGT GCG GAG CGG GAG GTG TTC GAG 6115 
Val Val Pro Ala Met Arg Arg Gly Arg Ala Glu Arg Glu Val Phe Glu 
1910 1915 1920 



50 



55 



GCG GCG CTG GCG ACG GTG TTC ACC CGG GAC GCC GGC CTG GAC GCC ACG 6163 
Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr 
1925 1930 1935 

GCA CTC CAC ACC GGG AGC ACC GGC CGG CGC ATC GAC CTC CCC ACC TAC 6211 
Ala Leu His Thr Gly Ser Thr Gly Arg Arg He Asp Leu Pro Thr Tyr 
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1940 1945 1950 

CCC TTC CAA CGC GAC CGC TAC TGG CTG GAC CCC GTT CGC ACC GCC GTG 6259 
Pro Phe Gin Arg Asp Arg Tyr Trp Leu Asp Pro Val Arg Thr Ala Val 
1955 1960 1965 1970 

ACC GGC GTC GAG CCC GCC GGC TCG CCG GCG GAC GCT CGG GCC ACT GAG 6307 
Thr Gly Val Glu Pro Ala Gly Ser Pro Ala Asp Ala Arg Ala Thr Glu 
1975 1980 1985 

CGG GGA CGG TCG ACG ACG GCC GGG ATC CGC TAC CGC GTC GCT TGG CAG 6355 
Arg Gly Arg Ser Thr Thr Ala Gly lie Arg Tyr Arg Val Ala Trp Gin 
1990 1995 2000 

CCG GCC GTC GTC GAC CGC GGC AAC CCC GGG CCT GCC GGT CAT GTG CTG 6403 
Pro Ala Val Val Asp Arg Gly Asn Pro Gly Pro Ala Gly His Val Leu 
2005 2010 2015 

CTT CTG GCC CCG GAC GAG GAC ACG GCC GAC TCC GGA CTC GCC CCC GCG 6451 
Leu Leu Ala Pro Asp Glu Asp Thr Ala Asp Ser Gly Leu Ala Pro Ala 
2020 2025 2030 

ATC GCA CGT GAA CTC GCC GTG CGC GGG GCC GAG GTC CAC ACC GTC GCC 6499 
He Ala Arg Glu Leu Ala Val Arg Gly Ala Glu Val His Thr Val Ala 
2035 2040 2045 2050 

25 GTG CCG GTC GGT ACA GGC CGG GAG GCA GCC GGG GAC CTG TTG CGG GCC 6547 

Val Pro Val Gly Thr Gly Arg Glu Ala Ala Gly Asp Leu Leu Arg Ala 
2055 2060 2065 
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GCC GGT GAC GGT GCC GCC CGC AGC ACC CGA GTT CTG TGG CTC GCC CCG 6595 
Ala Gly Asp Gly Ala Ala Arg Ser Thr Arg Val Leu Trp Leu Ala Pro 
2070 2075 2080 

GCC GAG CCG GAC GCG GCC GAC GCC GTC GCC CTC GTC CAG GCG CTG GGC 6643 
Ala Glu Pro Asp Ala Ala Asp Ala Val Ala Leu Val Gin Ala Leu Gly 
2085 2090 2095 

GAG GCG GTA CCC GAA GCC CCG CTC TGG ATC ACC ACC CGT GAG GCG GCG 6691 
Glu Ala Val Pro Glu Ala Pro Leu Trp He Thr Thr Arg Glu Ala Ala 
2100 2105 2110 

GCC GTG CGG CCG GAC GAG ACC CCT TCC GTC GGG GGC GCT CAG CTG TGG 6739 
Ala Val Arg Pro Asp Glu Thr Pro Ser Val Gly Gly Ala Gin Leu Trp 
2H5 2120 2125 2130 



GGA CTC GGA CAG GTC GCC GCG CTC GAA CTG GGG CGG CGC TGG GGC GGC 6787 
Gly Leu Gly Gin Val Ala Ala Leu Glu Leu Gly Arg Arg Trp Gly Gly 
45 2135 2140 2145 

TTG GCG GAC CTG CCC GGG AGT GCG TCG CCC GCG GTG CTC CGT ACG TTC 6835 
Leu Ala Asp Leu Pro Gly Ser Ala Ser Pro Ala Val Leu Arg Thr Phe 
2150 2155 2160 



GTC GGG GCG CTG CTC GCC GGG GGA GAG AAC CAG TTC GCG GTA CGG CCC 6883 
Val Gly Ala Leu Leu Ala Gly Gly Glu Asn Gin Phe Ala Val Arg Pro 
2165 2170 2175 

TCC GGC GTC CAT GTC CGC CGT GTG GTT CCC GCG CCC GTC CCC GTC CCG 6931 
Ser Gly Val His Val Arg Arg Val Val Pro Ala Pro Val Pro Val Pro 
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2180 2185 2190 

GCC TCC GCT CGC ACC GTC ACC ACG GCC CCC GCC ACC GCC GTC GGC GAG 6979 
Ala Ser Ala Arg Thr Val Thr Thr Ala Pro Ala Thr Ala Val Gly Glu 
2195 2200 2205 2210 

GAC GCA CGG AAC GAC ACC TCG GAC GTG GTC GTG CCG GAC GAC CGG TGG 7027 
Asp Ala Arg Asn Asp Thr Ser Asp Val Val Val Pro Asp Asp Arg Trp 
2215 2220 2225 

TCC TCC GGC ACC GTA CTG ATC ACC GGG GGC ACC GGT GCC CTG GGT GCG 7075 
Ser Ser Gly Thr Val Leu lie Thr Gly Gly Thr Gly Ala Leu Gly Ala 
2230 2235 2240 

CAG GTC GCC CGC AGG CTC GCC CGG TCG GGC GCC GCG CGT CTG CTC CTG 7123 
Gin Val Ala Arg Arg Leu Ala Arg Ser Gly Ala Ala Arg Leu Leu Leu 
2245 2250 2255 

GTG GGC CGG CGC GGC GCG GCC GGC CCC GGA GTG GGC GAA CTC GTC GAG 7171 
Val Gly Arg Arg Gly Ala Ala Gly Pro Gly Val Gly Glu Leu Val Glu 
2260 2265 2270 

GAG CTG ACG GCG CTC GGT TCC GAA GTG GCC GTC GAG GCC TGC GAC GTC 7219 
Glu Leu Thr Ala Leu Gly Ser Glu Val Ala Val Glu Ala Cys Asp Val 
2275 2280 2285 2290 

25 GCC GAC CGG GAC GCA CTG GCC GCG CTC CTC GCG GGC CTC CCC GAG GAG 7267 

Ala Asp Arg Asp Ala Leu Ala Ala Leu Leu Ala Gly Leu Pro Glu Glu 
2295 2300 2305 
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CGG CCC CTC GTC GCC GTA CTG CAC GCG GCA GGT GTG CTC GAC GAC GGT 7315 
Arg Pro Leu Val Ala Val Leu His Ala Ala Gly Val Leu Asp Asp Gly 
2310 2315 2320 

GTG CTC GAC TCG CTC ACC TCC GAC CGG GTG GAC GCC GTA CTG CGG GAC 7363 
Val Leu Asp Ser Leu Thr Ser Asp Arg Val Asp Ala Val Leu Arg Asp 
2325 2330 2335 

AAG GTC ACC GCC GCC CGT CAC CTG GAC GAG CTG ACC GCG GAC CTT CCG 7411 
Lys Val Thr Ala Ala Arg His Leu Asp Glu Leu Thr Ala Asp Leu Pro 
2340 2345 2350 

CTC GAC GCC TTC GTG CTC TTC TCC TCC ATC GTC GGC GTG TGG GGC AAC 7459 
Leu Asp Ala Phe Val Leu Phe Ser Ser lie Val Gly Val Trp Gly Asn 
2355 2360 2365 2370 



GGA GGG CAG GCC GTC TAC GCG GCC GCC AAC GCC GCG CTC GAC GCC CTG 7507 
Gly Gly Gin Ala Val Tyr Ala Ala Ala Asn Ala Ala Leu Asp Ala Leu 
45 2375 2380 2385 

GCG CAG CGG CGC CGG GCC AGG GGA GCC CGT GCC GCC TCG ATC GCC TGG 7555 
Ala Gin Arg Arg Arg Ala Arg Gly Ala Arg Ala Ala Ser He Ala Trp 
2390 2395 2400 



GGG CCG TGG GCC GGT GCC GGA ATG GCC TCC GGA ACG GCG GCG AAG TCC 7 603 

Gly Pro Trp Ala Gly Ala Gly Met Ala Ser Gly Thr Ala Ala Lys Ser 
2405 2410 2415 

TTC GAA CGG GAC GGC GTC ACG GCC CTG GAC CCC GAG CGC GCG CTC GAC 7651 
Phe Glu Arg Asp Gly Val Thr Ala Leu Asp Pro Glu Arg Ala Leu Asp 
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2420 2425 2430 

GTC CTC GAC GAC GTG GTG GGC GCC GGC GGG ACC TCT GCC GCA GGG ACG 7699 
Val Leu Asp Asp Val Val Gly Ala Gly Gly Thr Ser Ala Ala Gly Thr 
2435 2440 2445 2450 

CAC GCG GCC GGC GAG AGC TCC CTG CTC GTC GCC GAC GTG GAC TGG GAG 7747 
His Ala Ala Gly Glu Ser Ser Leu Leu Val Ala Asp Val Asp Trp Glu 
2455 2460 2465 

ACC TTC GTC GGG CGT TCG GTC ACC CGC CGT ACC TGG TCG CTC TTC GAC 7795 
Thr Phe Val Gly Arg Ser Val Thr Arg Arg Thr Trp Ser Leu Phe Asp 
2470 2475 2480 

GGC GTC TCC GCC GCC CGT TCG GCG CGT GCC GGC CAT GCC GCG GAC GAC 7843 
Gly Val Ser Ala Ala Arg Ser Ala Arg Ala Gly His Ala Ala Asp Asp 
2485 2490 2495 

CGT GCC GCT CTC ACC CCA GGG ACG CGG CCG GGC GAC GGC GCA CCG GGC 7891 
Arg Ala Ala Leu Thr Pro Gly Thr Arg Pro Gly Asp Gly Ala Pro Gly 
2500 2505 2510 

GGG AGC GGA CAG GAC GGG GGC GAG GGC CGG CCG TGG CTC TCC GTC GGC 7939 
Gly Ser Gly Gin Asp Gly Gly Glu Gly Arg Pro Trp Leu Ser Val Gly 
2515 2520 2525 2530 

25 CCC TCG CCG GCG GAA CGC CGT CGT GCT CTG CTC ACG CTT GTG CGC TCG 7987 

Pro Ser Pro Ala Glu Arg Arg Arg Ala Leu Leu Thr Leu Val Arg Ser 
2535 2540 2545 
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GAG GCC GCC GGG ATC CTG CGC CAC GCC TCG GCC GAC GCG GTC GAC CCG 803 5 

Glu Ala Ala Gly He Leu Arg His Ala Ser Ala Asp Ala Val Asp Pro 
2550 2555 2560 

GAG CTG GCC TTC CGG TCC GCC GGG TTC GAC TCC CTC ACC GTT CTC GAA 8083 
Glu Leu Ala Phe Arg Ser Ala Gly Phe Asp Ser Leu Thr Val Leu Glu 
2565 2570 2575 

CTG CGT AAC CGC CTG ACC GCT GCC ACC GGC CTG AAC CTG CCG AAC ACG 8131 
Leu Arg Asn Arg Leu Thr Ala Ala Thr Gly Leu Asn Leu Pro Asn Thr 
2580 2585 2590 

CTG CTC TTC GAC CAC CCG ACC CCC CTC TCG CTC GCC TCC CAC CTG CAC 8179 
Leu Leu Phe Asp His Pro Thr Pro Leu Ser Leu Ala Ser His Leu His 
2595 2600 2605 2610 



GAC GAA CTG TTC GGT CCC GAC AGC GAG GCG GAG CCG GCA GCG GCC GCC 8227 
Asp Glu Leu Phe Gly Pro Asp Ser Glu Ala Glu Pro Ala Ala Ala Ala 
45 2615 2620 2625 

CCC ACG CCG GTC ATG GCC GAC GAG CGT GAG CCG ATC GCG ATC GTG GGC 8275 
Pro Thr Pro Val Met Ala Asp Glu Arg Glu Pro He Ala He Val Gly 
2630 2635 2640 



ATG GCG TGC CGT TAC CCG GGC GGT GTG GCG TCG CCG GAC GAC CTG TGG 8323 

Met Ala Cys Arg Tyr Pro Gly Gly Val Ala Ser Pro Asp Asp Leu Trp 

2645 2650 2655 

GAC CTG GTG GCC GGT GAC GGG CAC ACG CTC TCC CCG TTC CCG GCC GAC 8371 

Asp Leu Val Ala Gly Asp Gly His Thr Leu Ser Pro Phe Pro Ala Asp 
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2660 



2665 



2670 



CGT GGC TGG GAC GTC GAG GGG CTG TAC GAC CCG GAG CCG GGG GTG CCG 
Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly Val Pro 
2675 2680 2685 2690 



8419 



JO 



GGC AAG AGC TAT GTA CGG GAA GGC GGG TTC CTG CGT TCC GCG GCC GAG 8467 
Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu Arg Ser Ala Ala Glu 
2695 2700 2705 

TTC GAC GCG GAG TTC TTC GGG ATA TCG CCG CGC GAG GCC ACG GCC ATG 8515 
Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Thr Ala Met 
2710 2715 2720 



15 



GAC CCG CAG CAG CGG TTG CTG CTG GAG ACG TCG TGG GAG GCG CTG GAG 
Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala Leu Glu 
2725 2730 2735 



8563 



20 



CGG GCC GGC ATC GTT CCG GAC TCG CTG CGC GGC ACC CGG ACC GGT GTC 8611 
Arg Ala Gly He Val Pro Asp Ser Leu Arg Gly Thr Arg Thr Gly Val 
2740 2745 2750 

TTC AGC GGC ATC TCC CAG CAG GAC TAC GCG ACC CAG CTG GGG GAC GCC 8659 
Phe Ser Gly He Ser Gin Gin Asp Tyr Ala Thr Gin Leu Gly Asp Ala 
2755 2760 2765 2770 



25 GCC GAC ACC TAC GGC GGG CAT GTG CTC ACG GGG ACC CTC GGC AGT GTG 

Ala Asp Thr Tyr Gly Gly His Val Leu Thr Gly Thr Leu Gly Ser Val 
2775 2780 2785 



30 



ATC TCC GGT CGG GTT GCC TAT GCG TTG GGG TTG GAG GGG CCG GCG CTG 
He Ser Gly Arg Val Ala Tyr Ala Leu Gly Leu Glu Gly Pro Ala Leu 
2790 2795 2800 



8707 



8755 



35 



ACG GTG GAC ACG GCG TGT TCG TCG TCG TTG GTG GCG TTG CAT CTG GCG 

Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His Leu Ala 
2805 2810 2815 

GTG CAG TCG TTG CGG CGG GGT GAG TGT GAT CTG GCG TTG GCC GGT GGG 

Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Leu Ala Leu Ala Gly Gly 
2820 2825 2830 
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GTG ACG GTG ATG GCG ACG CCG ACG GTG TTC GTG GAG TTC TCG CGG CAG 8899 
Val Thr Val Met Ala Thr Pro Thr Val Phe Val Glu Phe Ser Arg Gin 
2835 2840 2845 2850 

CGG GGG CTG GCG GCG GAC GGG CGG TGC AAG GCG TTC GCG GAG GGT GCG 8947 
Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Glu Gly Ala 
2855 2860 2865 

GAC GGG ACG GCG TGG GCG GAG GGT GTG GGT GTG CTG CTG GTG GAG CGG 8995 
Asp Gly Thr Ala Trp Ala Glu Gly Val Gly Val Leu Leu Val Glu Arg 
2870 2875 2880 

CTT TCC GAC GCG CGC CGC AAC GGT CAT CGG GTG CTG GCG GTG GTG CGG 9043 
Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val Val Arg 
2885 2890 2895 

GGC AGT GCG GTC AAT CAG GAC GGT GCG AGC AAT GGG CTG ACG GCG CCG 9091 
Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr Ala Pro 
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2900 



2905 



2910 



AGT GGT CCG GCG CAG CAG CGG GTG ATC CGT GAG GCG CTG GCT GAT GCG 
Ser Gly Pro Ala Gin Gin Arg Val He Arg Glu Ala Leu Ala Asp Ala 
2915 2920 2925 2930 



9139 



10 



GGG CTG GTG CCC GCC GAC GTG GAT GTG GTG GAG GCG CAC GGT ACG GGG 9187 
Gly Leu Val Pro Ala Asp Val Asp Val Val Glu Ala His Gly Thr Gly 
2935 2940 2945 

ACG GCG CTG GGT GAT CCG ATC GAG GCG GGT GCG CTG CTG GCC ACG TAC 9235 
Thr Ala Leu Gly Asp Pro He Glu Ala Gly Ala Leu Leu Ala Thr Tyr 
2950 2955 2960 
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GGG CGG GAG CGG GTC GGC GAT CCG TTG TGG CTC GGG TCG TTG AAG TCG 
Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser Leu Lys Ser 
2965 2970 2975 



9283 



AAC ATC GGG CAT GCG CAG GCG GCT GCG GGT GTG GGT GGT GTG ATC AAG 9331 
Asn He Gly His Ala Gin Ala Ala Ala Gly Val Gly Gly Val He Lys 
2980 2985 2990 

GTG GTG CAG GGG ATG CGG CAT GGG TCG TTG CCG CGG ACG CTG CAT GTG 9379 
Val Val Gin Gly Met Arg His Gly Ser Leu Pro Arg Thr Leu His Val 
2995 3000 3005 3010 



25 GAT GCG CCG TCG TCG AAG GTG GAG TGG GCT TCG GGT GCG GTG GAG CTC 

Asp Ala Pro Ser Ser Lys Val Glu Trp Ala Ser Gly Ala Val Glu Leu 
3015 3020 3025 



9427 



CTG ACC GAG ACC CGG TCG TGG CCG CGG CGG GTG GAG CGG GTG CGG CGG 9475 
Leu Thr Glu Thr Arg Ser Trp Pro Arg Arg Val Glu Arg Val Arg Arg 
3030 3035 3040 

GCC GCG GTG TCG GCG TTC GGG GTG AGC GGG ACC AAC GCC CAT GTG GTC 9523 
Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His Val Val 
3045 3050 3055 

CTG GAG GAA GCG CCG GCG GAG GCC GGG AGC GAG CAC GGG GAC GGC CCT 9571 
Leu Glu Glu Ala Pro Ala Glu Ala Gly Ser Glu His Gly Asp Gly Pro 
3060 3065 3070 

GAA CCT GAG CGG CCC GAC GCG GTG ACG GGT CCG TTG TCG TGG GTG CTT 9619 
Glu Pro Glu Arg Pro Asp Ala Val Thr Gly Pro Leu Ser Trp Val Leu 
307 5 3080 3085 3090 

TCT GCG CGG TCG GAG GGG GCG TTG CGG GCG CAG GCG GTG CGG TTG CGT 9667 
Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val Arg Leu Arg 
3095 3100 3105 

GAG TGT GTG GAG CGG GTG GGT GCG GAT CCG CGG GAT GTG GCG GGG TCG 9715 
Glu Cys Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val Ala Gly Ser 
3110 3115 3120 

TTG GTG GTG TCG CGT GCG TCG TTC GGT GAG CGT GCG GTG GTG GTG GGC 9763 
Leu Val Val Ser Arg Ala Ser Phe Gly Glu Arg Ala Val Val Val Gly 
3125 3130 3135 

CGG GGG CGT GAG GAG TTG CTG GCG GGT CTG GAT GTG GTG GCT GCC GGG 9811 
Arg Gly Arg Glu Glu Leu Leu Ala Gly Leu Asp Val Val Ala Ala Gly 
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3140 



3145 



3150 



GCT CCT GTG GGT GTG TCT TCG GGG GCC GGT GCT GTG GTG CGG GGG AGT 
Ala Pro Val Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg Gly Ser 
3155 31S0 3165 3170 



9859 



10 



GCG GTG CGG GGT CGT GGG GTG GGG GTG TTG TTC ACG GGT CAG GGT GCG 9907 
Ala Val Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala 
3175 3180 3185 

CAG TGG GTT GGT ATG GGG CGT GGG TTG TAT GCG GGG GGT GGG GTG TTT 9955 
Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe 
3190 3195 3200 



is 



GCG GAG GTG CTG GAT GAG GTG TTG TCG GTG GTG GGG GAG GTG GAT GGT 
Ala Glu Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val Asp Gly 
3205 3210 3215 



10003 



20 



CGG TCG TTG CGG GAT GTG ATG TTC GCG GAT GCT GAC TCG GTT TTG GGT 10051 
Arg Ser Leu Arg Asp Val Met Phe Ala Asp Ala Asp Ser Val Leu Gly 
3220 3225 3230 

GGG TTG TTG GGT CGG ACG GAG TTT GCT CAG CCT GCG TTG TTT GCG TTG 10099 
Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe Ala Leu 
3235 3240 3245 3250 



25 GAG GTG GCG TTG TTC CGG GCG TTG GAG GCT CGG GGT GTG GAG GTG TCG 

Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu Val Ser 
3255 3260 3265 



10147 



30 



GTG GTG TTG GGT CAT TCG GTG GGG GAG GTG GCT GCT GCG TAT GTG GCG 
Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr Val Ala 
3270 3275 3280 



10195 



35 



GGG GTG TTG TCG TTG GGT GAT GCG GTG CGG TTG GTG GTG GCG CGG GGT 10243 
Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala Arg Gly 
3285 3290 3295 

GGG TTG ATG GGT GGG TTG CCG GTG GGT GGG GGG ATG TGG TCG GTG GGG 10291 
Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser Val Gly 
3300 3305 3310 



40 



45 



GCG TCG GAG TCG GTG GTG CGG GGG GTT GTT GAG GGG TTG GGG GAG TGG 10339 
Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly Glu Trp 
3315 3320 3325 3330 

GTG TCG GTT GCG GCG GTG AAT GGG CCG CGG TCG GTG GTG TTG TCG GGT 10387 
Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu Ser Gly 
3335 3340 3345 



so 



GAT GTG GGT GTG CTG GAG TCG GTG GTT GTC ACG CTG ATG GGG GAT GGG 10435 
Asp Val Gly Val Leu Glu Ser Val Val Val Thr Leu Met Gly Asp Gly 
3350 3355 3360 

GTG GAG TGC CGG CGG TTG GAT GTG TCG CAT GGG TTT CAT TCG GTG TTG 10483 
Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His Ser Val Leu 
3365 3370 3375 



55 



ATG GAG CCG GTG TTG GGG GAG TTC CGG GGG GTT GTG GAG TCG TTG GAG 
Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser Leu Glu 



10531 
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3380 3385 3390 

TTC GGT CGG GTG CGG CCG GGT GTG GTG GTG GTG TCG GGT GTG TCG GGT 10579 
Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val Ser Gly 
3395 3400 3405 3410 

10627 



10675 



GGG GTG 
Gly Val 


GTG 
Val 


GGT TCG GGG 
Gly Ser Gly 
3415 


GAG 
Glu 


TTG 
Leu 


GGG GAT CCG 
Gly Asp Pro 
3420 


GGG TAT 
Gly Tyr 


TGG GTG CGT 
Trp Val Arg 
3425 


CAT 
His 


GCG 
Ala 


CGG 
Arg 


GAG GCG 
Glu Ala 
3430 


GTG 
Val 


CGT 
Arg 


TTC 
Phe 


GCG GAT GGG 
Ala Asp Gly 
3435 


GTG 
Val 


GGG 
Gly 


GTG GTG 
Val Val 
3440 


CGT 
Arg 


GGT 
Gly 


CTG 
Leu 


GGT GTG GGG 
Gly Val Gly 
3445 


ACG 
Thr 


TTG 
Leu 


GTG GAG GTG GGT 
Val Glu Val Gly 
3450 


CCG 
Pro 


CAT GGG 
His Gly 
3455 


GTG 
Val 


CTG 
Leu 


ACG GGG ATG 
Thr Gly Met 
3460 


GCG GGT 
Ala Gly 


CAG 
Gin 


TGC CTG 
Cys Leu 
3465 


GAG GCC GGT 
Glu Ala Gly 


GAT GAT 
Asp Asp 
3470 


GTG 
Val 


GTG 
Val 


GTG 
Val 


GTG 


CCG 


GCG 


ATG CGG 


CGG 


GGC 


CGT 


CCG GAG CGG 


GAG 


GTG 


TTC 


GAG 


GCG 



10723 



10771 



10819 

Val Pro Ala Met Arg Arg Gly Arg Pro Glu Arg Glu Val Phe Glu Ala 
3475 3480 3485 3490 

25 GCG CTG GCG ACG GTG TTC ACC CGG GAC GCC GGC CTC GAC GCC ACG ACA 10867 

Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala Thr Thr 
3495 3500 3505 



CTC CAC ACC GGG AGC ACC GGC CGA CGC ATC GAC CTC CCC ACC TAC CCC 10915 
Leu His Thr Gly Ser Thr Gly Arg Arg lie Asp Leu Pro Thr Tyr Pro 
3510 3515 3520 

TTC CAA CAC AAC CGC TAC TGG GCA ACC GGC TCA GTG ACC GGT GCG ACC 10963 
Phe Gin His Asn Arg Tyr Trp Ala Thr Gly Ser Val Thr Gly Ala Thr 
3525 3530 3535 

GGC ACC TCG GCA GCC GCG CGC TTC GGC CTG GAG TGG AAG GAC CAC CCC 11011 
Gly Thr Ser Ala Ala Ala Arg Phe Gly Leu Glu Trp Lys Asp His Pro 
3540 3545 3550 

TTC CTC AGC GGC GCC ACG CCG ATA GCC GGC TCC GGC GCG CTG CTC CTC 11059 
Phe Leu Ser Gly Ala Thr Pro lie Ala Gly Ser Gly Ala Leu Leu Leu 
3555 3560 3565 3570 



ACC GGC AGG GTG GGG CTC GCT GCC CAC CCG TGG CTG GCC GAC CAC GCC 11107 
Thr Gly Arg Val Gly Leu Ala Ala His Pro Trp Leu Ala Asp His Ala 
45 3575 3580 3585 

ATC TCC GGC ACG GTG CTG CTC CCC GGA ACG GCG ATC GCC GAC CTG CTG 11155 
He Ser Gly Thr Val Leu Leu Pro Gly Thr Ala He Ala Asp Leu Leu 
3590 3595 3600 



CTG CGG GCG GTC GAG GAG GTC GGC GCC GGA GGG GTC GAG GAA CTG ACG 11203 
Leu Arg Ala Val Glu Glu Val Gly Ala Gly Gly Val Glu Glu Leu Thr 
3605 3610 3615 

CTC CAT GAG CCC CTG CTC CTC CCC GAG CGA GGC GGC CTG CAC GTC CAG 11251 
Leu His Glu Pro Leu Leu Leu Pro Glu Arg Gly Gly Leu His Val Gin 
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3620 



3625 



3630 



GTG CTG GTC GAG GCG GCC GAC GAG CAG GGA CGG CGT GCC GTG GCA GTC 
Val Leu Val Glu Ala Ala Asp Glu Gin Gly Arg Arg Ala Val Ala Val 
3635 3640 3645 3650 



11299 



10 



GCC GCA CGC CCG GAG GGC CCT GGG CGG GAC GGT GAG GAA CAG GAG TGG 11347 
Ala Ala Arg Pro Glu Gly Pro Gly Arg Asp Gly Glu Glu Gin Glu Trp 
3655 3660 3665 

ACC CGG CAC GCG GAA GGC GTG CTC ACC TCC ACC GAG ACG GCC GTT CCG 11395 
Thr Arg His Ala Glu Gly Val Leu Thr Ser Thr Glu Thr Ala Val Pro 
3670 3675 3680 



15 



GAC ATG GGC TGG GCC GCC GGG GCC TGG CCG CCG CCC GGT GCC GAG CCG 
Asp Met Gly Trp Ala Ala Gly Ala Trp Pro Pro Pro Gly Ala Glu Pro 
3685 3690 3695 



11443 



20 



ATC GAC GTC GAG GAG CTG TAC GAC GCG TTC GCC GCG GAC GGC TAC GGC 11491 
lie Asp Val Glu Glu Leu Tyr Asp Ala Phe Ala Ala Asp Gly Tyr Gly 
3700 3705 3710 

TAC GGC CCG GCC TTC ACC GCA CTG TCC GGC GTG TGG CGT CTC GGC GAC 11539 
Tyr Gly Pro Ala Phe Thr Ala Leu Ser Gly Val Trp Arg Leu Gly Asp 
3715 3720 3725 3730 



25 GAA CTC TTC GCC GAG GTG CGG CGG CCC GCG GGG GGC GCG GGC ACG ACC 

Glu Leu Phe Ala Glu Val Arg Arg Pro Ala Gly Gly Ala Gly Thr Thr 
3735 3740 3745 



11587 



30 



GGT GAC GGT TTC GGC GTC CAC CCC GCA CTC TTC GAT GCG GCC CTC CAC 
Gly Asp Gly Phe Gly Val His Pro Ala Leu Phe Asp Ala Ala Leu His 
3750 3755 3760 



11635 



35 



CCG TGG CGC GCC GGC GGG CTG CTG CCC GAC ACG GGC GGC ACC ACC TGG 11683 

Pro Trp Arg Ala Gly Gly Leu Leu Pro Asp Thr Gly Gly Thr Thr Trp 
3765 3770 3775 

GCG CCG TTC TCC TGG CAG GGC ATC GCG CTC CAC ACC ACC GGA GCC GAG 11731 

Ala Pro Phe Ser Trp Gin Gly lie Ala Leu His Thr Thr Gly Ala Glu 
3780 3785 3790 



40 



ACG CTC CGC GTC AGA CTG GCC CCT GCG GCC GGC GGC ACC GAG TCG GCC 
Thr Leu Arg Val Arg Leu Ala Pro Ala Ala Gly Gly Thr Glu Ser Ala 
3795 3800 3805 3810 



11779 



45 



TTC TCC GTA CAG GCC GCC GAC CCG GCG GGC ACC CCG GTC CTC ACC CTC 
Phe Ser Val Gin Ala Ala Asp Pro Ala Gly Thr Pro Val Leu Thr Leu 
3815 3820 3825 



11827 



50 



GAC GCA CTG CTG CTC CGC CCG GTG ACC CTG GGG AGG GCC GAC GCG CCG 11875 
Asp Ala Leu Leu Leu Arg Pro Val Thr Leu Gly Arg Ala Asp Ala Pro 
3830 3835 3840 

CAA CCG CTG TAC CGC GTC GAC TGG CAG CCG GTC GGC CAG GGG ACC GAG 11923 
Gin Pro Leu Tyr Arg Val Asp Trp Gin Pro Val Gly Gin Gly Thr Glu 
3845 3850 3855 



55 



GCC TCC GGC GCC CAG GGC TGG ACG GTG CTC GGG CAG GCC GCG GCC GAG 
Ala Ser Gly Ala Gin Gly Trp Thr Val Leu Gly Gin Ala Ala Ala Glu 



11971 
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3860 



3865 



3870 



ACG GTC GCG CAG CCC GCC GCC CAT GCG GAC CTC ACC GCC CTG CGT ACG 
Thr Val Ala Gin Pro Ala Ala His Ala Asp Leu Thr Ala Leu Arg Thr 
3875 3880 3885 3890 



12019 



10 



GCT GTG GCC GCG GCG GGA AC A CCC GTG CCC CGG CTG GTG GTC GTG TCG 12067 
Ala Val Ala Ala Ala Gly Thr Pro Val Pro Arg Leu Val Val Val Ser 
3895 3900 3905 

CCG GTG GAC ACC CGG CTG GAC GAG GGG CCG GTG CTG GCG GAC GCC GAG 12115 
Pro Val Asp Thr Arg Leu Asp Glu Gly Pro Val Leu Ala Asp Ala Glu 
3910 3915 3920 



75 



GCT CGG GCC CGT GCG GGT GAC GGC TGG GAC GAC GAT CCC CTA CGT GTC 
Ala Arg Ala Arg Ala Gly Asp Gly Trp Asp Asp Asp Pro Leu Arg Val 
3925 3930 3935 



12163 



20 



GCC CTC GGG CGC GGC CTG ACC CTG GTC CGG GAG TGG GTC GAG GAC GAA 
Ala Leu Gly Arg Gly Leu Thr Leu Val Arg Glu Trp Val Glu Asp Glu 
3940 3945 3950 



12211 



CGG TTG GCG GAC TCC CGG CTC GTC GTC CTC ACC CGT GGC GCG GTG GCG 
Arg Leu Ala Asp Ser Arg Leu Val Val Leu Thr Arg Gly Ala Val Ala 
3955 3960 3965 3970 



12259 



25 GCC GGT CCC GGC GAT GTG CCG GAC CTG ACA GGT GCG GCC CTG TGG GGG 

Ala Gly Pro Gly Asp Val Pro Asp Leu Thr Gly Ala Ala Leu Trp Gly 
3975 3980 3985 



12307 



CTG CTC CGC TCC GCG CAG TCG GAG TAT CCG GAC CGC TTC ACC CTC ATC 
30 Leu Leu Arg Ser Ala Gin Ser Glu Tyr Pro Asp Arg Phe Thr Leu lie 

3990 3995 4000 



12355 



35 



GAC GTG GAC GAT TCC CCC GAG TCC CGT GCG GCT CTG CCC CGG GCT CTG 12403 
Asp Val Asp Asp Ser Pro Glu Ser Arg Ala Ala Leu Pro Arg Ala Leu 
4005 4010 4015 

GGA TCG GCC GAG CGA CAA CTC GCC CTG CGG ACG GGC GAC GTG CTG GCG 12451 
Gly Ser Ala Glu Arg Gin Leu Ala Leu Arg Thr Gly Asp Val Leu Ala 
4020 4025 4030 



40 CCG GCC CTG GTC CCG ATG GCC ACC CGG CCG GCG GAG ACC ACT CCA GCG 

Pro Ala Leu Val Pro Met Ala Thr Arg Pro Ala Glu Thr Thr Pro Ala 
4035 4040 4045 4050 



12499 



ACG GCG GTC GCC TCG GCG ACA ACA CAG ACA CAG GTC ACC GCG CCC GCT 
Thr Ala Val Ala Ser Ala Thr Thr Gin Thr Gin Val Thr Ala Pro Ala 
4055 4060 4065 



12547 



so 



CCC GAC GAC CCG GCT GCG GAT GCC GTG TTC GAC CCG GCG GGC ACC GTA 12595 
Pro Asp Asp Pro Ala Ala Asp Ala Val Phe Asp Pro Ala Gly Thr Val 
4070 4075 4080 

CTG ATC ACC GGC GGC ACC GGC GCC CTG GGA CGG CGT GTC GCC TCG CAC 12643 
Leu lie Thr Gly Gly Thr Gly Ala Leu Gly Arg Arg Val Ala Ser His 
4085 4090 4095 



55 



CTC GCG CGC CGG TAC GGC GTA CGC CAC ATG CTT CTG GTC AGC AGG CGT 
Leu Ala Arg Arg Tyr Gly Val Arg His Met Leu Leu Val Ser Arg Arg 



12691 
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4100 4105 4110 

GGA CCG GAC GCC CCC GAG GCC GGT CCC CTG GAA CGG GAA CTC GCC GGT 
Gly Pro Asp Ala Pro Glu Ala Gly Pro Leu Glu Arg Glu Leu Ala Gly 
4115 4120 4125 4130 

CTC GGA GTC ACC GCC ACC TTC CTG GCA TGC GAC CTC ACC GAC ATC GAG 
Leu Gly Val Thr Ala Thr Phe Leu Ala Cys Asp Leu Thr Asp He Glu 
4135 4140 4145 

GCC GTA CGG AAG GCC GTC GCC GCG GTG CCG TCG GAC CAC CCG CTG ACC 
Ala Val Arg Lys Ala Val Ala Ala Val Pro Ser Asp His Pro Leu Thr 
4150 4155 4160 

GGT GTG GTG CAC ACC GCC GGC GTG CTG GAC GAC GGC GCC CTG ACC GGC 
Gly Val Val His Thr Ala Gly Val Leu Asp Asp Gly Ala Leu Thr Gly 
4165 4170 4175 

CTG ACC CGG CAA CGC CTC GAC ACC GTG CTG CGG CCC AAG GCC GAC GCC 
Leu Thr Arg Gin Arg Leu Asp Thr Val Leu Arg Pro Lys Ala Asp Ala 
4180 4185 4190 

GTG CGG AAC CTC CAC GAG GCG ACC CTC GAC CGG CCG CTG CGC GCG TTC 
Val Arg Asn Leu His Glu Ala Thr Leu Asp Arg Pro Leu Arg Ala Phe 
4195 4200 4205 4210 

GTC CTG TTC TCC GCC GCC GCC GGA CTC CTG GGC CGC CCC GGG CAG GCC 
Val Leu Phe Ser Ala Ala Ala Gly Leu Leu Gly Arg Pro Gly Gin Ala 
4215 4220 4225 

TCC TAC GCC GCC GCC AAC GCG GTC CTC GAC GCG CTC GCG GGA GCC CGC 
Ser Tyr Ala Ala Ala Asn Ala Val Leu Asp Ala Leu Ala Gly Ala Arg 
4230 4235 4240 

CGC GCG GCC GGA CTG CCC GCA GTG TCC CTG GCG TGG GGC CTG TGG GAC 
Arg Ala Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu Trp Asp 
4245 4250 4255 

GAG CAG ACG GGC ATG GCA GGA GGC CTC GAC GAG ATG GCC CTG CGC GTG 
Glu Gin Thr Gly Met Ala Gly Gly Leu Asp Glu Met Ala Leu Arg Val 
4260 4265 4270 

CTG CGC CGG GAC GGC ATC GCC GCG ATG CCT CCG GAG CAG GGG CTC GAA 
Leu Arg Arg Asp Gly He Ala Ala Met Pro Pro Glu Gin Gly Leu Glu 
4275 4280 4285 4290 

CTG CTC GAC CTG GCC CTG ACC GGA CAC CGG GAC GGA CCC GCC GTC CTC 
Leu Leu Asp Leu Ala Leu Thr Gly His Arg Asp Gly Pro Ala Val Leu 
4295 4300 4305 

GTC CCC CTC CTC CTC GAC GGC GCG GCC CTG CGC CGC ACG GCG AAG GAG 
Val Pro Leu Leu Leu Asp Gly Ala Ala Leu Arg Arg Thr Ala Lys Glu 
4310 4315 4320 

CGC GGC GCG GCC ACG ATG TCC CCC TTG CTG CGC GCC CTG CTG CCC GCC 
Arg Gly Ala Ala Thr Met Ser Pro Leu Leu Arg Ala Leu Leu Pro Ala 
4325 4330 4335 

GCC CTG CGC CGC AGC GGT GGA GCC GGC GCC CCC GCG GCG GCC GAC CGG 
Ala Leu Arg Arg Ser Gly Gly Ala Gly Ala Pro Ala Ala Ala Asp Arg 
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4340 



4345 



4350 



CAC GGC AAG GAG GCG GAC CCC GGT GCG GGA CGC CTC GCA GGG ATG GTG 
His Gly Lys Glu Ala Asp Pro Gly Ala Gly Arg Leu Ala Gly Met Val 
4355 4360 4365 4370 



13459 



10 



GCA CTC GAA GCG GCG GAG CGT TCC GCG GCC GTC CTT GAG CTG GTC ACC 13507 

Ala Leu Glu Ala Ala Glu Arg Ser Ala Ala Val Leu Glu Leu Val Thr 
4375 4380 4385 

GAA CAG GTC GCC GAG GTC CTC GGC TAC GCG TCG GCC GCG GAG ATC GAG 13555 

Glu Gin Val Ala Glu Val Leu Gly Tyr Ala Ser Ala Ala Glu lie Glu 
4390 4395 4400 



75 



CCC GAA CGA CCC TTC CGG GAG ATC GGC GTC GAC TCC CTG GCG GCG GTG 
Pro Glu Arg Pro Phe Arg Glu He Gly Val Asp Ser Leu Ala Ala Val 
4405 4410 4415 



13603 



20 



25 



GAG CTG CGC AAC CGG CTC AGC CGT CTG GTC GGC CTG CGG TTG CCG ACC 13651 
Glu Leu Arg Asn Arg Leu Ser Arg Leu Val Gly Leu Arg Leu Pro Thr 
4420 4425 4430 

ACG CTG TCC TTC GAC CAC CCC ACG CCG AAG GAC ATG GCG CAG CAC ATC 13699 
Thr Leu Ser Phe Asp His Pro Thr Pro Lys Asp Met Ala Gin His He 
4435 4440 4445 4450 



GAC GGG CAG CTC CCC CGC CCG GCC GGA GCC TCG CCC GCG GAC GCA GCG 13747 
Asp Gly Gin Leu Pro Arg Pro Ala Gly Ala Ser Pro Ala Asp Ala Ala 
4455 4460 4465 

CTG GAA GGG ATC GGC GAC CTC GCG CGG GCG GTC GCC CTG CTG GGC ACG 13795 
30 Leu Glu Gly He Gly Asp Leu Ala Arg Ala Val Ala Leu Leu Gly Thr 

4470 4475 4480 



35 



45 



GGC GAC GCC CGC CGG GCC GAG GTA CGA GAG CAG CTC GTC GGA CTG CTG 13843 
Gly Asp Ala Arg Arg Ala Glu Val Arg Glu Gin Leu Val Gly Leu Leu 
4485 4490 4495 

GCC GCG CTC GAC CCA CCT GGG CGG ACG GGC ACC GCC GCA CCC GGC GTC 13891 
Ala Ala Leu Asp Pro Pro Gly Arg Thr Gly Thr Ala Ala Pro Gly Val 
4500 4505 4510 



40 CCC TCC GGT GCC GAT GGC GCG GAA CCG ACC GTG ACG GAC CGG CTC GAC 

Pro Ser Gly Ala Asp Gly Ala Glu Pro Thr Val Thr Asp Arg Leu Asp 
4515 4520 4525 4530 



GAG GCG ACC GAC GAC GAG ATC TTC GCC TTC CTG GAC GAG CAG CTG TGA 
Glu Ala Thr Asp Asp Glu He Phe Ala Phe Leu Asp Glu Gin Leu * 
4535 4540 4545 



13939 



13987 



(2) INFORMATION FOR SEQ ID NO: 14: 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4546 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 



55 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Ser Gly Glu Leu Ala lie Ser Arg Ser Asp Asp Arg Ser Asp Ala 
15 10 15 

Val Ala Val Val Gly Met Ala Cys Arg Phe Pro Gly Ala Pro Gly He 
20 25 30 

Ala Glu Phe Trp Glu Leu Leu Arg Ser Gly Arg Gly Met Pro Thr Arg 
35 40 45 

Gin Asp Asp Gly Thr Trp Arg Ala Ala Leu Glu Asp His Ala Gly Phe 
50 55 60 

Asp Ala Gly Phe Phe Gly Met Asn Ala Arg Gin Ala Ala Ala Thr Asp 
65 70 75 80 

Pro Gin His Arg Leu Met Leu Glu Leu Gly Trp Glu Ala Leu Glu Asp 
85 90 95 

Ala Gly He Val Pro Gly Asp Leu Thr Gly Thr Asp Thr Gly Val Phe 
100 105 HO 

Ala Gly Val Ala Ser Asp Asp Tyr Ala Val Leu Thr Arg Arg Ser Ala 
115 120 125 

Val Ser Ala Gly Gly Tyr Thr Ala Thr Gly Leu His Arg Ala Leu Ala 
130 135 140 

Ala Asn Arg Leu Ser His Phe Leu Gly Leu Arg Gly Pro Ser Leu Val 
145 150 155 160 

Val Asp Ser Ala Gin Ser Ala Ser Leu Val Ala Val Gin Leu Ala Cys 
165 170 175 

Glu Ser Leu Arg Arg Gly Glu Thr Ser Leu Ala Val Ala Gly Gly Val 
180 185 190 

Asn Leu He Leu Thr Glu Glu Ser Thr Thr Val Met Glu Arg Met Gly 
195 200 205 

Ala Leu Ser Pro Asp Gly Arg Cys His Thr Phe Asp Ala Arg Ala Asn 
210 215 220 

Gly Tyr Val Arg Gly Glu Gly Gly Gly Ala Val Val Leu Lys Pro Leu 
225 230 235 240 

Asp Ala Ala Leu Ala Asp Gly Asp Arg Val Tyr Cys Val He Lys Gly 
245 250 255 

Gly Ala Val Asn Asn Asp Gly Gly Gly Ala Ser Leu Thr Thr Pro Asp 
260 265 270 

Arg Glu Ala Gin Glu Ala Val Leu Arg Gin Ala Tyr Arg Arg Ala Gly 
275 280 285 

Val Ser Thr Gly Ala Val Arg Tyr Val Glu Leu His Gly Thr Gly Thr 
290 295 300 

Arg Ala Gly Asp Pro Val Glu Ala Ala Ala Leu Gly Ala Val Leu Gly 
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305 



310 



315 



320 



Ala Gly Ala Asp Ser Gly Arg Ser Thr Pro Leu Ala Val Gly Ser Val 
325 330 335 

Lys Thr Asn Val Gly His Leu Glu Gly Ala Ala Gly lie Val Gly Leu 
340 345 350 

lie Lys Ala Thr Leu Cys Val Arg Lys Gly Glu Leu Val Pro Ser Leu 
355 360 365 

Asn Phe Ser Thr Pro Asn Pro Asp lie Pro Leu Asp Asp Leu Arg Leu 
370 375 380 

Arg Val Gin Thr Glu Arg Gin Glu Trp Asn Glu Glu Asp Asp Arg Pro 
385 390 395 400 

Arg Val Ala Gly Val Ser Ser Phe Gly Met Gly Gly Thr Asn Val His 
405 410 415 

Leu Val lie Ala Glu Ala Pro Ala Ala Ala Gly Ser Ser Gly Ala Gly 
420 425 430 

Gly Ser Gly Ala Gly Ser Gly Ala Gly lie Ser Ala Val Ser Gly Val 
435 440 445 

Val Pro Val Val Val Ser Gly Arg Ser Arg Val Val Val Arg Glu Ala 
450 455 460 

Ala Gly Arg Leu Ala Glu Val Val Glu Ala Gly Gly Val Gly Leu Ala 
465 470 475 480 

Asp Val Ala Val Thr Met Ala Asp Arg Ser Arg Phe Gly Tyr Arg Ala 
485 490 495 

Val Val Leu Ala Arg Gly Glu Ala Glu Leu Ala Gly Arg Leu Arg Ala 
500 505 510 

Leu Ala Gly Gly Asp Pro Asp Ala Gly Val Val Thr Gly Ala Val Leu 
515 520 525 

Asp Gly Gly Val Val Val Gly Ala Ala Pro Gly Gly Ala Gly Ala Ala 
530 535 540 

Gly Gly Ala Gly Ala Ala Gly Gly Ala Gly Gly Gly Gly Val Val Leu 
545 550 555 560 

Val Phe Pro Gly Gin Gly Thr Gin Trp Val Gly Met Gly Ala Gly Leu 
565 570 575 

Leu Gly Ser Ser Glu Val Phe Ala Ala Ser Met Arg Glu Cys Ala Arg 
580 585 590 

Ala Leu Ser Val His Val Gly Trp Asp Leu Leu Glu Val Val Ser Gly 
595 600 605 

Gly Ala Gly Leu Glu Arg Val Asp Val Val Gin Pro Val Thr Trp Ala 
610 615 620 



Val Met Val Ser Leu Ala Arg Tyr Trp Gin Ala Met Gly Val Asp Val 
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625 



630 



635 



640 



Ala Ala Val Val Gly His Ser Gin Gly Glu He Ala Ala Ala Thr Val 
645 650 655 

Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Ala Val Val Ala Leu Arg 

660 665 670 



70 



Ala Gly Leu He Gly Arg Tyr Leu Ala Gly Arg Gly Ala Met Ala Ala 
675 680 685 



Val Pro Leu Pro Ala Gly Glu Val Glu Ala Gly Leu Ala Lys Trp Pro 
690 695 700 



15 



Gly Val Glu Val Ala Ala Val Asn Gly Pro Ala Ser Thr Val Val Ser 
705 710 715 720 



Gly Asp Arg Arg Ala Val Ala Gly Tyr Val Ala Val Cys Gin Ala Glu 
725 730 735 



20 



Gly Val Gin Ala Arg Leu He Pro Val Asp Tyr Ala Ser His Ser Arg 
740 745 750 



His Val Glu Asp Leu Lys Gly Glu Leu Glu Arg Val Leu Ser Gly He 
755 760 765 



25 



Arg Pro Arg Ser Pro Arg Val Pro Val Cys Ser Thr Val Ala Gly Glu 
770 775 780 



30 



Gin Pro Gly Glu Pro Val Phe Asp Ala Gly Tyr Trp Phe Arg Asn Leu 

785 790 795 800 

Arg Asn Arg Val Glu Phe Ser Ala Val Val Gly Gly Leu Leu Glu Glu 

805 810 815 



35 



40 



Gly His Arg Arg Phe lie Glu Val Ser Ala His Pro Val Leu Val His 
820 825 830 

Ala He Glu Gin Thr Ala Glu Ala Ala Asp Arg Ser Val His Ala Thr 
835 840 845 

Gly Thr Leu Arg Arg Gin Asp Asp Ser Pro His Arg Leu Leu Thr Ser 
850 855 860 

Thr Ala Glu Ala Trp Ala His Gly Ala Thr Leu Thr Trp Asp Pro Ala 
865 870 875 880 



45 



Leu Pro Pro Gly His Leu Thr Thr Leu Pro Thr Tyr Pro Phe Asn His 
885 890 895 



His His Tyr Trp Leu Asp Thr He Asp Gly Gly Gly Gly Asp Asp Ala 
900 90S 910 



50 



Thr Gin Glu Lys Glu Ser Gly Pro Leu Thr Arg Glu Leu Arg Gly Leu 
915 920 925 



Pro Ser Ser Gin Lys Gin Leu Gly Phe Leu Leu Asp Leu Val Cys Arg 
930 935 940 



55 



His Thr Ala Val Val Leu Gly Leu Asp Thr Ala Ala Glu Val Asp Pro 
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945 950 955 960 

Asp Leu Ser Phe Lys Lys Gin Gly He Gin Ser Met Thr Gly Val Glu 
965 970 975 

Leu Arg Asn Arg Leu Leu Thr Glu Thr Gly Leu Ala Leu Pro Thr Thr 
980 985 990 

Leu Val Tyr Asp Arg Pro Thr Pro Arg Ala Leu Ala Gin Phe Leu His 
995 1000 1005 

Thr Glu Leu Leu Asp Gly Ser Pro Ser Gly Ser Val Leu Ala Pro Ala 
1010 1015 1020 

Gin Lys Ser Phe Glu Ala Gly Gly Pro Gly Val Leu Ser Ser Ala Ala 
1025 1030 1035 1040 

Val Gly Val Ser Asp Ala Arg Gly Gly Ser Arg Asp Asp Asp Asp Pro 
1045 1050 1055 

He Ala He Val Gly Val Gly Cys Arg Leu Pro Gly Gly Val Asp Ser 
1060 1065 1070 

Arg Ala Ala Leu Trp Glu Leu Leu Glu Ser Gly Ala Asp Ala He Ser 
1075 1080 1085 

Ser Phe Pro Thr Asp Arg Gly Trp Asp Leu Asp Gly Leu Tyr Asp Pro 
1090 1095 HOO 

Glu Pro Gly Thr Pro Gly Lys Thr Tyr Val Arg Glu Gly Gly Phe Leu 
1105 1110 1115 1120 

His Ser Ala Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg 
1125 H30 1135 

Glu Ala Thr Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Ala Ser 
1140 H45 1150 

Trp Glu Ala Leu Glu Asp Ala Gly Val Leu Pro Glu Ser Leu Arg Gly 
1155 1160 H65 

Gly Asp Ala Gly Val Phe Val Gly Ala Thr Ala Pro Glu Tyr Gly Pro 
1170 H75 1180 

Arg Leu His Glu Gly Ala Asp Gly Tyr Glu Gly Tyr Leu Leu Thr Gly 
1185 1190 H95 1200 

Thr Thr Ala Ser Val Ala Ser Gly Arg He Ala Tyr Thr Leu Gly Thr 
1205 1210 1215 

Gly Gly Pro Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val 
1220 1225 1230 

Ala Leu His Leu Ala Val Gin Ala Leu Arg Arg Gly Glu Cys Gly Leu 
1235 1240 1245 

Ala Leu Ala Gly Gly Ala Thr Val Met Ser Gly Pro Gly Met Phe Val 
1250 1255 1260 

Glu Phe Ser Arg Gin Arg Gly Leu Ala Pro Asp Gly Arg Cys Met Pro 
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1265 1270 1275 1280 

Phe Ser Ala Asp Ala Asp Gly Thr Ala Trp Ser Glu Gly Val Ala Val 
1285 1290 1295 

Leu Ala Leu Glu Arg Leu Ser Asp Ala Arg Arg Ala Gly His Arg Val 
1300 1305 1310 

Leu Gly Val Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn 
1315 1320 1325 

Gly Leu Thr Ala Pro Asn Arg Ser Ala Gin Glu Gly Val lie Arg Ala 
1330 1335 1340 

Ala Leu Ala Asp Ala Gly Leu Ala Pro Gly Asp Val Asp Ala Val Glu 
1345 1350 1355 1360 

Ala His Gly Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala Ser Ala 
1365 1370 1375 

Leu Leu Ala Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu 
1380 1385 1390 

Gly Ser Leu Lys Ser Asn Val Gly His Thr Gin Ala Ala Ala Gly Ala 
1395 1400 1405 

Ala Gly Val Val Lys Met Leu Leu Ala Leu Glu His Gly Thr Leu Pro 
1410 1415 1420 

Arg Thr Leu His Ala Asp Arg Pro Ser Thr His Val Asp Trp Ser Ser 
1425 1430 1435 1440 

Gly Thr Val Ala Leu Leu Ala Glu Ala Arg Arg Trp Pro Arg Arg Ser 
1445 1450 1455 

Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly He Ser Gly Thr 
1460 1465 1470 

Asn Ala His Leu He He Glu Glu Ala Pro Glu Trp Val Glu Asp He 
1475 1480 1485 

Asp Gly Val Ala Ala Pro Asp Arg Gly Thr Ala Asp Ala Ala Ala Pro 
1490 1495 1500 

Ser Pro Leu Leu Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin 
1505 1510 1515 1520 

Ala Val Arg Leu Gly Glu Tyr Val Glu Arg Val Gly Ala Asp Pro Arg 
1525 1530 1535 

Asp Val Ala Tyr Ser Leu Ala Ser Thr Arg Thr Leu Phe Glu His Arg 
1540 1545 1550 

Ala Val Val Pro Cys Gly Gly Arg Gly Glu Leu Val Ala Ala Leu Gly 
1555 1560 1565 

Gly Phe Ala Ala Gly Arg Val Ser Gly Gly Val Arg Ser Gly Arg Ala 
1570 1575 1580 

Val Pro Gly Gly Val Gly Val Leu Phe Thr Gly Gin Gly Ala Gin Trp 
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1585 1590 1595 1600 

Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly Val Phe Ala Glu 
1605 1610 1615 

Val Leu Asp Glu Val Leu Ser Met Val Gly Glu Val Asp Gly Arg Ser 
1620 1625 1630 

Leu Arg Asp Val Met Phe Gly Asp Val Asp Val Asp Ala Gly Ala Gly 
1635 1640 1645 

Ala Asp Ala Gly Ala Gly Ala Gly Ala Gly Val Gly Ser Gly Ser Gly 
1650 1655 1660 

Ser Val Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu 
1665 1670 1675 1680 

Phe Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val 
1685 1690 1695 

Glu Val Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala 
1700 1705 1710 

Tyr Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val 
1715 1720 1725 

Ala Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp 
1730 1735 1740 

Ser Val Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu 
1745 1750 1755 1760 

Gly Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val 
1765 1770 1775 

Leu Ser Gly Asp Val Gly Val Leu Glu Ser Val Val Ala Ser Leu Met 
1780 1785 1790 

Gly Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser His Gly Phe His 
1795 1800 1805 

Ser Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu 
1810 1815 1820 

Ser Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly 
1825 1830 1835 1840 

Val Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr 
1845 1850 1855 

Trp Val Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly 
I860 1865 1870 

Val Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His 
1875 1880 1885 

Gly Val Leu Thr Gly Met Ala Gly Glu Cys Leu Gly Ala Gly Asp Asp 
1890 1895 1900 

Val Val Val Val Pro Ala Met Arg Arg Gly Arg Ala Glu Arg Glu Val 
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1905 1910 1915 1920 

Phe Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp 
1925 1930 1935 

Ala Thr Ala Leu His Thr Gly Ser Thr Gly Arg Arg He Asp Leu Pro 
1940 1945 1950 

Thr Tyr Pro Phe Gin Arg Asp Arg Tyr Trp Leu Asp Pro Val Arg Thr 
1955 I960 1965 

Ala Val Thr Gly Val Glu Pro Ala Gly Ser Pro Ala Asp Ala Arg Ala 
1970 1975 1980 

Thr Glu Arg Gly Arg Ser Thr Thr Ala Gly He Arg Tyr Arg Val Ala 
1985 1990 1995 2000 

Trp Gin Pro Ala Val Val Asp Arg Gly Asn Pro Gly Pro Ala Gly His 
2005 2010 2015 

Val Leu Leu Leu Ala Pro Asp Glu Asp Thr Ala Asp Ser Gly Leu Ala 
2020 2025 2030 

Pro Ala He Ala Arg Glu Leu Ala Val Arg Gly Ala Glu Val His Thr 
2035 2040 2045 

Val Ala Val Pro Val Gly Thr Gly Arg Glu Ala Ala Gly Asp Leu Leu 
2050 2055 2060 

Arg Ala Ala Gly Asp Gly Ala Ala Arg Ser Thr Arg Val Leu Trp Leu 
2065 2070 2075 2080 

Ala Pro Ala Glu Pro Asp Ala Ala Asp Ala Val Ala Leu Val Gin Ala 
2085 2090 2095 

Leu Gly Glu Ala Val Pro Glu Ala Pro Leu Trp He Thr Thr Arg Glu 
2100 2105 2110 

Ala Ala Ala Val Arg Pro Asp Glu Thr Pro Ser Val Gly Gly Ala Gin 
2115 2120 2125 

Leu Trp Gly Leu Gly Gin Val Ala Ala Leu Glu Leu Gly Arg Arg Trp 
2130 2135 2140 

Gly Gly Leu Ala Asp Leu Pro Gly Ser Ala Ser Pro Ala Val Leu Arg 
2145 2150 2155 2160 

Thr Phe Val Gly Ala Leu Leu Ala Gly Gly Glu Asn Gin Phe Ala Val 
2165 2170 2175 

Arg Pro Ser Gly Val His Val Arg Arg Val Val Pro Ala Pro Val Pro 
2180 2185 2190 

Val Pro Ala Ser Ala Arg Thr Val Thr Thr Ala Pro Ala Thr Ala Val 
2195 2200 2205 

Gly Glu Asp Ala Arg Asn Asp Thr Ser Asp Val Val Val Pro Asp Asp 
2210 2215 2220 

Arg Trp Ser Ser Gly Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu 
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2225 2230 2235 2240 

Gly Ala Gin Val Ala Arg Arg Leu Ala Arg Ser Gly Ala Ala Arg Leu 
2245 2250 2255 

Leu Leu Val Gly Arg Arg Gly Ala Ala Gly Pro Gly Val Gly Glu Leu 
2260 2265 2270 

Val Glu Glu Leu Thr Ala Leu Gly Ser Glu Val Ala Val Glu Ala Cys 
2275 2280 2285 

Asp Val Ala Asp Arg Asp Ala Leu Ala Ala Leu Leu Ala Gly Leu Pro 
2290 2295 2300 

Glu Glu Arg Pro Leu Val Ala Val Leu His Ala Ala Gly Val Leu Asp 
2305 2310 2315 2320 

Asp Gly Val Leu Asp Ser Leu Thr Ser Asp Arg Val Asp Ala Val Leu 
2325 2330 2335 

Arg Asp Lys Val Thr Ala Ala Arg His Leu Asp Glu Leu Thr Ala Asp 
2340 2345 2350 

Leu Pro Leu Asp Ala Phe Val Leu Phe Ser Ser He Val Gly Val Trp 
2355 2360 2365 

Gly Asn Gly Gly Gin Ala Val Tyr Ala Ala Ala Asn Ala Ala Leu Asp 
2370 2375 2380 

Ala Leu Ala Gin Arg Arg Arg Ala Arg Gly Ala Arg Ala Ala Ser He 
2385 2390 2395 2400 

Ala Trp Gly Pro Trp Ala Gly Ala Gly Met Ala Ser Gly Thr Ala Ala 
2405 2410 2415 

Lys Ser Phe Glu Arg Asp Gly Val Thr Ala Leu Asp Pro Glu Arg Ala 
2420 2425 2430 

Leu Asp Val Leu Asp Asp Val Val Gly Ala Gly Gly Thr Ser Ala Ala 
2435 2440 2445 

Gly Thr His Ala Ala Gly Glu Ser Ser Leu Leu Val Ala Asp Val Asp 
2450 2455 2460 

Trp Glu Thr Phe Val Gly Arg Ser Val Thr Arg Arg Thr Trp Ser Leu 
2465 2470 2475 2480 

Phe Asp Gly Val Ser Ala Ala Arg Ser Ala Arg Ala Gly His Ala Ala 
2485 2490 2495 

Asp Asp Arg Ala Ala Leu Thr Pro Gly Thr Arg Pro Gly Asp Gly Ala 
2500 2505 2510 

Pro Gly Gly Ser Gly Gin Asp Gly Gly Glu Gly Arg Pro Trp Leu Ser 
2515 2520 2525 

Val Gly Pro Ser Pro Ala Glu Arg Arg Arg Ala Leu Leu Thr Leu Val 
2530 2535 2540 

Arg Ser Glu Ala Ala Gly He Leu Arg His Ala Ser Ala Asp Ala Val 
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2545 2550 2555 2560 

Asp Pro Glu Leu Ala Phe Arg Ser Ala Gly Phe Asp Ser Leu Thr Val 
2565 2570 2575 

Lou Glu Leu Arg Asn Arg Leu Thr Ala Ala Thr Gly Leu Asn Leu Pro 
2580 2585 2590 

Asn Thr Leu Leu Phe Asp His Pro Thr Pro Leu Ser Leu Ala Ser His 
2595 2600 2605 

Leu His Asp Glu Leu Phe Gly Pro Asp Ser Glu Ala Glu Pro Ala Ala 
2610 2615 2620 

Ala Ala Pro Thr Pro Val Met Ala Asp Glu Arg Glu Pro He Ala He 
2625 2630 2635 2640 

Val Gly Met Ala Cys Arg Tyr Pro Gly Gly Val Ala Ser Pro Asp Asp 
2645 2650 2655 

Leu Trp Asp Leu Val Ala Gly Asp Gly His Thr Leu Ser Pro Phe Pro 
2660 2665 2670 

Ala Asp Arg Gly Trp Asp Val Glu Gly Leu Tyr Asp Pro Glu Pro Gly 
2675 2680 2685 

Val Pro Gly Lys Ser Tyr Val Arg Glu Gly Gly Phe Leu Arg Ser Ala 
2690 2695 2700 

Ala Glu Phe Asp Ala Glu Phe Phe Gly He Ser Pro Arg Glu Ala Thr 
2705 2710 2715 2720 

Ala Met Asp Pro Gin Gin Arg Leu Leu Leu Glu Thr Ser Trp Glu Ala 
2725 2730 2735 

Leu Glu Arg Ala Gly He Val Pro Asp Ser Leu Arg Gly Thr Arg Thr 
2740 2745 2750 

Gly Val Phe Ser Gly He Ser Gin Gin Asp Tyr Ala Thr Gin Leu Gly 
2755 2760 2765 

Asp Ala Ala Asp Thr Tyr Gly Gly His Val Leu Thr Gly Thr Leu Gly 
2770 2775 2780 

Ser Val lie Ser Gly Arg Val Ala Tyr Ala Leu Gly Leu Glu Gly Pro 
2785 2790 2795 2800 

Ala Leu Thr Val Asp Thr Ala Cys Ser Ser Ser Leu Val Ala Leu His 
2805 2810 2815 

Leu Ala Val Gin Ser Leu Arg Arg Gly Glu Cys Asp Leu Ala Leu Ala 
2820 2825 2830 

Gly Gly Val Thr Val Met Ala Thr Pro Thr Val Phe Val Glu Phe Ser 
2835 2840 2845 

Arg Gin Arg Gly Leu Ala Ala Asp Gly Arg Cys Lys Ala Phe Ala Glu 
2850 2855 2860 

Gly Ala Asp Gly Thr Ala Trp Ala Glu Gly Val Gly Val Leu Leu Val 
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2865 2870 2875 2880 

Glu Arg Leu Ser Asp Ala Arg Arg Asn Gly His Arg Val Leu Ala Val 
2885 2890 2895 

Val Arg Gly Ser Ala Val Asn Gin Asp Gly Ala Ser Asn Gly Leu Thr 
2900 2905 2910 

Ala Pro Ser Gly Pro Ala Gin Gin Arg Val lie Arg Glu Ala Leu Ala 
2915 2920 2925 

Asp Ala Gly Leu Val Pro Ala Asp Val Asp Val Val Glu Ala His Gly 
2930 2935 2940 

Thr Gly Thr Ala Leu Gly Asp Pro He Glu Ala Gly Ala Leu Leu Ala 
2945 2950 2955 2960 

Thr Tyr Gly Arg Glu Arg Val Gly Asp Pro Leu Trp Leu Gly Ser Leu 
2965 2970 2975 

Lys Ser Asn He Gly His Ala Gin Ala Ala Ala Gly Val Gly Gly Val 
2980 2985 2990 

He Lys Val Val Gin Gly Met Arg His Gly Ser Leu Pro Arg Thr Leu 
2995 3000 3005 

His Val Asp Ala Pro Ser Ser Lys Val Glu Trp Ala Ser Gly Ala Val 
3010 3015 3020 

Glu Leu Leu Thr Glu Thr Arg Ser Trp Pro Arg Arg Val Glu Arg Val 
3025 3030 3035 3040 

Arg Arg Ala Ala Val Ser Ala Phe Gly Val Ser Gly Thr Asn Ala His 
3045 3050 3055 

Val Val Leu Glu Glu Ala Pro Ala Glu Ala Gly Ser Glu His Gly Asp 
3060 3065 3070 

Gly Pro Glu Pro Glu Arg Pro Asp Ala Val Thr Gly Pro Leu Ser Trp 
3075 3080 3085 

Val Leu Ser Ala Arg Ser Glu Gly Ala Leu Arg Ala Gin Ala Val Arg 
3090 3095 3100 

Leu Arg Glu Cys Val Glu Arg Val Gly Ala Asp Pro Arg Asp Val Ala 
3105 3110 3115 3120 

Gly Ser Leu Val Val Ser Arg Ala Ser Phe Gly Glu Arg Ala Val Val 
3125 3130 3135 

Val Gly Arg Gly Arg Glu Glu Leu Leu Ala Gly Leu Asp Val Val Ala 
3140 3145 3150 

Ala Gly Ala Pro Val Gly Val Ser Ser Gly Ala Gly Ala Val Val Arg 
3155 3160 3165 

Gly Ser Ala Val Arg Gly Arg Gly Val Gly Val Leu Phe Thr Gly Gin 
3170 3175 3180 

Gly Ala Gin Trp Val Gly Met Gly Arg Gly Leu Tyr Ala Gly Gly Gly 
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3185 3190 3195 3200 

Val Phe Ala Glu Val Leu Asp Glu Val Leu Ser Val Val Gly Glu Val 
3205 3210 3215 

Asp Gly Arg Ser Leu Arg Asp Val Met Phe Ala Asp Ala Asp Ser Val 
3220 3225 3230 

Leu Gly Gly Leu Leu Gly Arg Thr Glu Phe Ala Gin Pro Ala Leu Phe 
3235 3240 3245 

Ala Leu Glu Val Ala Leu Phe Arg Ala Leu Glu Ala Arg Gly Val Glu 
3250 3255 3260 

Val Ser Val Val Leu Gly His Ser Val Gly Glu Val Ala Ala Ala Tyr 
3265 3270 3275 3280 

Val Ala Gly Val Leu Ser Leu Gly Asp Ala Val Arg Leu Val Val Ala 
3285 3290 3295 

Arg Gly Gly Leu Met Gly Gly Leu Pro Val Gly Gly Gly Met Trp Ser 
3300 3305 3310 

Val Gly Ala Ser Glu Ser Val Val Arg Gly Val Val Glu Gly Leu Gly 
3315 3320 3325 

Glu Trp Val Ser Val Ala Ala Val Asn Gly Pro Arg Ser Val Val Leu 
3330 3335 3340 

Ser Gly Asp Val Gly Val Leu Glu Ser Val Val Val Thr Leu Met Gly 
3345 3350 3355 3360 

Asp Gly Val Glu Cys Arg Arg Leu Asp Val Ser Hi3 Gly Phe His Ser 
3365 3370 3375 

Val Leu Met Glu Pro Val Leu Gly Glu Phe Arg Gly Val Val Glu Ser 
3380 3385 3390 

Leu Glu Phe Gly Arg Val Arg Pro Gly Val Val Val Val Ser Gly Val 
3395 3400 3405 

Ser Gly Gly Val Val Gly Ser Gly Glu Leu Gly Asp Pro Gly Tyr Trp 
3410 3415 3420 

Val Arg His Ala Arg Glu Ala Val Arg Phe Ala Asp Gly Val Gly Val 
3425 3430 3435 3440 

Val Arg Gly Leu Gly Val Gly Thr Leu Val Glu Val Gly Pro His Gly 
3445 3450 3455 

Val Leu Thr Gly Met Ala Gly Gin Cys Leu Glu Ala Gly Asp Asp Val 
3460 3465 3470 

Val Val Val Pro Ala Met Arg Arg Gly Arg Pro Glu Arg Glu Val Phe 
3475 3480 3485 

Glu Ala Ala Leu Ala Thr Val Phe Thr Arg Asp Ala Gly Leu Asp Ala 
3490 3495 3500 

Thr Thr Leu His Thr Gly Ser Thr Gly Arg Arg lie Asp Leu Pro Thr 
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3505 



3510 



3515 



3520 



Tyr Pro Phe Gin His Asn Arg Tyr Trp Ala Thr Gly Ser Val Thr Gly 
3525 3530 3535 

Ala Thr Gly Thr Ser Ala Ala Ala Arg Phe Gly Leu Glu Trp Lys Asp 
3540 3545 3550 

His Pro Phe Leu Ser Gly Ala Thr Pro He Ala Gly Ser Gly Ala Leu 
3555 3560 3565 

Leu Leu Thr Gly Arg Val Gly Leu Ala Ala His Pro Trp Leu Ala Asp 
3570 3575 3580 

His Ala He Ser Gly Thr Val Leu Leu Pro Gly Thr Ala He Ala Asp 
3585 3590 3595 360( 

Leu Leu Leu Arg Ala Val Glu Glu Val Gly Ala Gly Gly Val Glu Glu 
3605 3610 3615 

Leu Thr Leu His Glu Pro Leu Leu Leu Pro Glu Arg Gly Gly Leu His 
3620 3625 3630 

Val Gin Val Leu Val Glu Ala Ala Asp Glu Gin Gly Arg Arg Ala Val 
3635 3640 3645 

Ala Val Ala Ala Arg Pro Glu Gly Pro Gly Arg Asp Gly Glu Glu Gin 
3650 3655 3660 

Glu Trp Thr Arg His Ala Glu Gly Val Leu Thr Ser Thr Glu Thr Ala 
3665 3670 3675 368C 

Val Pro Asp Met Gly Trp Ala Ala Gly Ala Trp Pro Pro Pro Gly Ala 
3685 3690 3695 

Glu Pro He Asp Val Glu Glu Leu Tyr Asp Ala Phe Ala Ala Asp Gly 
3700 3705 3710 

Tyr Gly Tyr Gly Pro Ala Phe Thr Ala Leu Ser Gly Val Trp Arg Leu 
3715 3720 3725 

Gly Asp Glu Leu Phe Ala Glu Val Arg Arg Pro Ala Gly Gly Ala Gly 
3730 3735 3740 

Thr Thr Gly Asp Gly Phe Gly Val His Pro Ala Leu Phe Asp Ala Ala 
3745 3750 3755 3760 

Leu His Pro Trp Arg Ala Gly Gly Leu Leu Pro Asp Thr Gly Gly Thr 
3765 3770 3775 

Thr Trp Ala Pro Phe Ser Trp Gin Gly He Ala Leu His Thr Thr Gly 
3780 3785 3790 

Ala Glu Thr Leu Arg Val Arg Leu Ala Pro Ala Ala Gly Gly Thr Glu 
3795 3800 3805 

Ser Ala Phe Ser Val Gin Ala Ala Asp Pro Ala Gly Thr Pro Val Leu 



3810 



3815 



3820 



Thr Leu Asp Ala Leu 



Leu 



Leu Arg Pro Val Thr 



Leu Gly Arg Ala Asp 
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3825 3830 3835 3840 

Ala Pro Gin Pro Leu Tyr Arg Val Asp Trp Gin Pro Val Gly Gin Gly 
3845 3850 3855 

Thr Glu Ala Ser Gly Ala Gin Gly Trp Thr Val Leu Gly Gin Ala Ala 
3860 3865 3870 

Ala Glu Thr Val Ala Gin Pro Ala Ala His Ala Asp Leu Thr Ala Leu 
3875 3880 3885 

Arg Thr Ala Val Ala Ala Ala Gly Thr Pro Val Pro Arg Leu Val Val 
3890 3895 3900 

Val Ser Pro Val Asp Thr Arg Leu Asp Glu Gly Pro Val Leu Ala Asp 
3905 3910 3915 3920 

Ala Glu Ala Arg Ala Arg Ala Gly Asp Gly Trp Asp Asp Asp Pro Leu 
3925 3930 3935 

Arg Val Ala Leu Gly Arg Gly Leu Thr Leu Val Arg Glu Trp Val Glu 
3940 3945 3950 

Asp Glu Arg Leu Ala Asp Ser Arg Leu Val Val Leu Thr Arg Gly Ala 
3955 3960 3965 

Val Ala Ala Gly Pro Gly Asp Val Pro Asp Leu Thr Gly Ala Ala Leu 
3970 3975 3980 

Trp Gly Leu Leu Arg Ser Ala Gin Ser Glu Tyr Pro Asp Arg Phe Thr 
3985 3990 3995 4000 

Leu He Asp Val Asp Asp Ser Pro Glu Ser Arg Ala Ala Leu Pro Arg 
4005 4010 4015 

Ala Leu Gly Ser Ala Glu Arg Gin Leu Ala Leu Arg Thr Gly Asp Val 
4020 4025 4030 

Leu Ala Pro Ala Leu Val Pro Met Ala Thr Arg Pro Ala Glu Thr Thr 
4035 4040 4045 

Pro Ala Thr Ala Val Ala Ser Ala Thr Thr Gin Thr Gin Val Thr Ala 
4050 4055 4060 

Pro Ala Pro Asp Asp Pro Ala Ala Asp Ala Val Phe Asp Pro Ala Gly 
4065 4070 4075 4080 

Thr Val Leu He Thr Gly Gly Thr Gly Ala Leu Gly Arg Arg Val Ala 
4085 4090 4095 

Ser His Leu Ala Arg Arg Tyr Gly Val Arg His Met Leu Leu Val Ser 
4100 4105 4110 



Arg Arg Gly Pro Asp 
4115 

Ala Gly Leu Gly Val 
4130 

He Glu Ala Val Arg 



Pro Glu Ala Gly Pro 
4120 



Ala Thr Phe Leu Ala 
4135 



Ala Val Ala Ala Val 



Leu Glu Arg Glu Leu 
4125 

Cys Asp Leu Thr Asp 
4140 

Pro Ser Asp His Pro 



Ala 
Thr 
Lys 
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4145 4150 4155 4160 

Leu Thr Gly Val Val His Thr Ala Gly Val Leu Asp Asp Gly Ala Leu 
4165 4170 4175 

Thr Gly Leu Thr Arg Gin Arg Leu Asp Thr Val Leu Arg Pro Lys Ala 
4180 4185 4190 

Asp Ala Val Arg Asn Leu His Glu Ala Thr Leu Asp Arg Pro Leu Arg 
4195 4200 4205 

Ala Phe Val Leu Phe Ser Ala Ala Ala Gly Leu Leu Gly Arg Pro Gly 
4210 4215 4220 

Gin Ala Ser Tyr Ala Ala Ala Asn Ala Val Leu Asp Ala Leu Ala Gly 
4225 4230 4235 4240 

Ala Arg Arg Ala Ala Gly Leu Pro Ala Val Ser Leu Ala Trp Gly Leu 
4245 4250 4255 

Trp Asp Glu Gin Thr Gly Met Ala Gly Gly Leu Asp Glu Met Ala Leu 
4260 4265 4270 

Arg Val Leu Arg Arg Asp Gly He Ala Ala Met Pro Pro Glu Gin Gly 
4275 4280 4285 

Leu Glu Leu Leu Asp Leu Ala Leu Thr Gly His Arg Asp Gly Pro Ala 
4290 4295 4300 
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Val Leu Val Pro Leu Leu Leu Asp Gly Ala Ala Leu Arg Arg Thr Ala 
4305 4310 4315 4320 

Lys Glu Arg Gly Ala Ala Thr Met Ser Pro Leu Leu Arg Ala Leu Leu 
4325 4330 4335 

Pro Ala Ala Leu Arg Arg Ser Gly Gly Ala Gly Ala Pro Ala Ala Ala 
4340 4345 4350 

Asp Arg His Gly Lys Glu Ala Asp Pro Gly Ala Gly Arg Leu Ala Gly 
4355 4360 4365 

Met Val Ala Leu Glu Ala Ala Glu Arg Ser Ala Ala Val Leu Glu Leu 
4370 4375 4380 

Val Thr Glu Gin Val Ala Glu Val Leu Gly Tyr Ala Ser Ala Ala Glu 
4385 4390 4395 4400 

He Glu Pro Glu Arg Pro Phe Arg Glu He Gly Val Asp Ser Leu Ala 
4405 4410 4415 

Ala Val Glu Leu Arg Asn Arg Leu Ser Arg Leu Val Gly Leu Arg Leu 
4420 4425 4430 

Pro Thr Thr Leu Ser Phe Asp His Pro Thr Pro Lys Asp Met Ala Gin 
25 4435 4440 4445 

His He Asp Gly Gin Leu Pro Arg Pro Ala Gly Ala Ser Pro Ala Asp 
4450 4455 4460 

30 Ala Ala Leu Glu Gly He Gly Asp Leu Ala Arg Ala Val Ala Leu Leu 

4465 4470 4475 4480 

Gly Thr Gly Asp Ala Arg Arg Ala Glu Val Arg Glu Gin Leu Val Gly 
4485 4490 4495 



10 



15 



20 



35 



40 



55 



Leu Leu Ala Ala Leu Asp Pro Pro Gly Arg Thr Gly Thr Ala Ala Pro 
4500 4505 4510 

Gly Val Pro Ser Gly Ala Asp Gly Ala Glu Pro Thr Val Thr Asp Arg 
4515 4520 4525 

Leu Asp Glu Ala Thr Asp Asp Glu He Phe Ala Phe Leu Asp Glu Gin 
4530 4535 4540 



Leu 

45 4 5 4 5 



50 
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(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CGAGGCCGGC GGGCC 



Claims 

20 1. A DNA molecule comprising an isolated DNA sequence that encodes a tylactone synthase domain. 

2. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 

of: 

nucleotides 942 to 21 56, 2571 to 3557, 3675 to 3929, 3993 to 5264, 5631 to 6617, 7410 to 7949, 8220 to 
25 8471, 8541 to 9812, 10260 to 11246, 11319 to 11876, 12861 to 13415, 13719 to 13970, 14411 to 15697, 16055 

to 17122, 17198 to 17794, 18584 to 19138, 19415 to 19666, 20136 to 21404, 21771 to 22757, 23541 to 24077, 
24360 to 24611, 24675 to 25949, 26292 to 27284, 27360 to 27917, 28767 to 29813, 29829 to 30368, 30651 to 
30902, 31337 to 32608, 32975 to 33961, 34694 to 35236, 35492 to 35743, 36360 to 37631, 37989 to 38987, 
39759 to 40313, 40575 to 40826, and 41235 to 41333 all in SEQ ID NO:1. 

30 

3. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 

of: 

nucleotides 942 to 8471, 8541 to 13970, 14411 to 19666, 20136 to 24611, 24675 to 30902, 31337 to 35743, 
and 36360 to 40826 all in SEQ ID NO:1 . 

35 

4. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is selected from the group consisting 
of: 

nucleotides 81 6 to 1 4234, 1 4351 to 1 9945, 2001 0 to 31 1 99, 31 232 to 36067, and 36249 to 41 774 all in SEQ 
IDNO:1. 

40 

5. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is SEQ ID NO:1 . 

6. A polypeptide comprising an amino acid sequence that consists of a tylactone synthase domain. 

45 7. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of: 

(a) amino acids 43 to 447, 586 to 914, 954 to 1038, 1060 to 1483, 1606 to 1 934, 21 99 to 2378, 2469 to 2552, 
2576 to 2999, 3149 to 3477, 3502 to 3687, 4016 to 4200, and 4302 to 4385 in SEQ ID NO:2; 

(b) amino acids 21 to 449, 569 to 924, 950 to 1148, 1412 to 1596, and 1689 to 1772 in SEQ ID NO:3; 

50 ( C ) amino acids 43 to 465, 588 to 916, 1178 to 1356, and 1451 to 1534, 1556 to 1980, 2095 to 2425, 2451 to 

2636, 3274 to 3453, and 3548 to 3631 in SEQ ID NO:4; 

(d) amino acids 36 to 459, 582 to 910, 1155 to 1335, and 1421 to 1504 in SEQ ID NO:5; and 

(e) amino acids 38 to 461, 581 to 913, 1171 to 1355, 1443 to 1526, and 1663 to 1695 in SEQ ID NO:6. 

55 8. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of: 

(a) amino acids 1060 to 2552 and 2576 to 4385 in SEQ ID NO:2; 

(b) amino acids 21 to 1772 in SEQ ID NO:3; 
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(c) amino acids 43 to 1534 and 1556 to 3631 in SEQ ID NO:4; 

(d) amino acids 36 to 1504 in SEQ ID NO:5; and 

(e) amino acids 38 to 1526 in SEQ ID NO:6. 

9. The polypeptide of claim 6 wherein the amino acid sequence is selected from the group consisting of SEQ ID NO: 
2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, and SEQ ID NO:6. 

10. A recombinant DNA vector comprising a DNA molecule of Claim 1 . 

11. A recombinant DNA vector comprising a DNA molecule of Claim 2. 

12. A recombinant DNA vector comprising a DNA molecule of Claim 3. 

13. A recombinant DNA vector comprising a DNA molecule of Claim 4. 

14. A recombinant DNA vector comprising a DNA molecule of Claim 5. 

15. A recombinant DNA vector of Claim 10 which is NRRL B-18688. 

16. A recombinant DNA vector of Claim 10 which is NRRL B-18689 

17. A host cell transformed with a recombinant DNA vector of Claim 10. 

1 8. A host cell transformed with a recombinant DNA vector of Claim 11 . 

19. A host cell transformed with a recombinant DNA vector of Claim 12. 

20. A host cell transformed with a recombinant DNA vector of Claim 1 3. 

21. A host cell transformed with a recombinant DNA vector of Claim 14. 

22. The DNA molecule as claimed in Claim 1 wherein the isolated DNA sequence is SEQ ID NO:7. 

23. The polypeptide of SEQ ID NO:8. 
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Fig. 6 
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